Distributed Training Pune
-
Pune, India
-
Posted: less than a week ago
-
Save
Join a highly advanced AI infrastructure team focused on building and optimizing large-scale machine learning systems. This workplace leverages cutting-edge technologies to enable high-performance experimentation, scalable model deployment, and efficient processing of large datasets.
The team operates globally, bringing together engineers and researchers to push the boundaries of deep learning, distributed systems, and next-generation compute platforms.
About the Role
This position is centered on maximizing the efficiency and scalability of GPU-based machine learning workloads, particularly for large language models (LLMs) and generative AI systems.
You will work on improving both training performance and inference efficiency, ensuring optimal utilization of hardware resources, reduced latency, and faster model iteration cycles. The role requires hands-on expertise in deep learning frameworks, distributed systems, and performance optimization.
Key Responsibilities
Enhance performance of distributed training frameworks such as PyTorch, DeepSpeed, or similar systems
Identify and resolve bottlenecks in large-scale training pipelines (e.g., memory usage, communication overhead, GPU utilization)
Optimize inference systems using techniques like quantization, caching, and batching to achieve low latency and high throughput
Collaborate with infrastructure and platform teams to improve resource orchestration, scheduling, and system reliability
Design benchmarking tools and metrics to measure training efficiency, system throughput, and latency performance
Apply advanced optimization techniques (e.g., mixture-of-experts, speculative decoding, model parallelism) to improve large model performance
Continuously evaluate recent approaches to hardware acceleration and model execution efficiency
Required Qualifications
3+ years of hands-on experience optimizing GPU-based machine learning workloads
Solid expertise in deep learning frameworks such as PyTorc Apply on Kit Job: kitjob.in/job/4m8k2n
-
Company nameGoogle
-
Job positionDistributed Training Pune
Distributed Training Pune has been posted in the Pune Transportation & Logistics category on Locanto.
If you’re looking for something similar, check out Boost Your Business with Reliable Logistic Services in Pune by K, Pune, Enjoy Exciting Discounts on Courier & Logistics Services in Pune, Pune or Same Day Delivery Service in Mangalwar Peth, Pune in Shop No 02, CTS No 133/134, New Classic Hight, Near Mangal Mitra, Pune, also posted in Transportation & Logistics. Right now, there are 19 classified ads in Transportation & Logistics in Pune on Locanto.
You can find the Transportation & Logistics category under Jobs. Want something else? Check out the related categories Administrative & Support, BPO & KPO and Healthcare, Beauty & Wellness Pune.
Interested in more? Widen your search to view ads in nearby areas of Pune. This includes Transportation & Logistics in Pimpri-Chinchwad, Lohogaon and Hadapsar. There are more ads within a 15 km radius for this category. If you want to view those ads, click here.