Distributed Training (Salem)
-
Salem, India
-
Posted: less than a week ago
-
Save
- Enhance performance of distributed training frameworks such as PyTorch, DeepSpeed, or similar systems
- Identify and resolve bottlenecks in large-scale training pipelines (e.g., memory usage, communication overhead, GPU utilization)
- Optimize inference systems using techniques like quantization, caching, and batching to achieve low latency and high throughput
- Collaborate with infrastructure and platform teams to improve resource orchestration, scheduling, and system reliability
- Design benchmarking tools and metrics to measure training efficiency, system throughput, and latency performance
- Apply advanced optimization techniques (e.g., mixture-of-experts, speculative decoding, model parallelism) to improve large model performance
- Continuously evaluate new approaches to hardware acceleration and model execution efficiency
Required Qualifications
- 3+ years of hands-on experience optimizing GPU-based machine learning workloads
- Strong expertise in deep learning frameworks such as PyTorc Apply on Kit Job: kitjob.in/job/4mj29d
-
Company nameGoogle
-
Job positionDistributed Training (Salem)
Distributed Training (Salem) has been posted in the Salem Transportation & Logistics category on Locanto.
Why not check out other ads in this category, such as CDL A Shuttle Truck Driver (R204201), Salem, CDL A Local Delivery Truck Driver (R205841), Salem or CDL A Shuttle Truck Driver - Sysco Portland - Newport (R230925) in 4354 S Coast Hwy 101, Salem. Right now, there are 3 classified ads in Transportation & Logistics in Salem on Locanto.
You can find the Transportation & Logistics category under Jobs. Want something else? Check out the related categories Administrative & Support, Healthcare, Beauty & Wellness and Marketing, Advertising & PR Salem.
There are more ads within a 15 km radius for this category. If you want to view those ads, click here.