Distributed Training & Inference Optimization Engineer (Kolkata)
-
Kolkata, India
-
Posted: a week ago
-
Save
- Enhance performance of distributed training frameworks such as PyTorch, Deep Speed, or similar systems
- Identify and resolve bottlenecks in large-scale training pipelines (e.g., memory usage, communication overhead, GPU utilization)
- Optimize inference systems using techniques like quantization, caching, and batching to achieve low latency and high throughput
- Collaborate with infrastructure and platform teams to improve resource orchestration, scheduling, and system reliability
- Design benchmarking tools and metrics to measure training efficiency, system throughput, and latency performance
- Apply advanced optimization techniques (e.g., mixture-of-experts, speculative decoding, model parallelism) to improve large model performance
- Continuously evaluate new approaches to hardware acceleration and model execution efficiency Required Qualifications
- 3+ years of hands-on experience optimizing GPU-based machine learning workloads
- Strong expertise in deep learning frameworks such as PyTorch, Deep Speed, or equivalent
- Experience with distributed training techniques for large-scale models
- Solid understanding of inference optimization strategies (e.g., quantization, pruning, caching, batching)
- Degree in Computer Science, Engineering, or a related technical field Preferred Qualifications
- Experience with CUDA programming and GPU performance profiling tools
- Familiarity with distributed systems communication libraries and optimization techniques
- Knowledge of model optimization methods such as Flash Attention, LoRA, or similar techniques
- Experience working with containerized or orchestrated environments for ML workloads
- Contributions to open-source machine learning or infrastructure projects
- Hands-on experience with modern inference serving frameworks Apply on Kit Job: kitjob.in/job/4lxrm2
-
Company nameGoogle
-
Job positionDistributed Training & Inference Optimization Engineer (Kolkata)
Distributed Training & Inference Optimization Engineer (Kolkata) has been posted in the Kolkata Transportation & Logistics category on Locanto.
If you’re still wanting to browse, there is so much to explore in the Transportation & Logistics category! Take a look at the ads digital marketing classes, Kolkata, CDL A Local Delivery Truck Driver (R226194), Salt Lake City and CDL A Local Delivery Truck Driver (R232865) in 9494 South Prosperity, Salt Lake City to discover more of what you’re looking for. Currently, there are 6 ads posted in the Transportation & Logistics category in Kolkata.
You can find the Transportation & Logistics category under Jobs. Want something else? Check out the related categories Engineering, Part Time Jobs & Side Jobs and Marketing, Advertising & PR Kolkata.
Interested in more? Widen your search to view ads in nearby areas of Kolkata. This includes Transportation & Logistics in Sibpur, Pātipukur and South Dumdum. There are more ads within a 15 km radius for this category. If you want to view those ads, click here.