India

Solution Architect - GPU/TPU Kernel Optimization, Coimbatore

Solution Architect - GPU/TPU Kernel Optimization, Coimbatore
Description
We are seeking an experienced Solution Architect specializing in GPU/TPU Kernel Optimization to design and optimize high-performance kernels for cutting-edge Machine Learning operations. In this role, you will redefine performance boundaries across massive training runs and high-speed inference workloads while shaping the developer infrastructure that powers next-generation AI systems.Responsibilities Design and optimize high-performance kernels (using languages like Pallas, Mosaic and Triton) targeting Tensor Processing Unit (TPU) and Graphics Processing Unit (GPU) architectures for critical Machine Learning (ML) operations, redefining what's possible from massive training runs to high-speed inferenceArchitect infrastructure such as benchmarking suites, autotuning frameworks, performance analysis tools, regression testing and documentation Transform how the developer community interacts with increasingly critical custom kernels in key Open-Source Software (OSS) libraries Track the latest advancements in hardware architectures, compiler technologies and AI models to identify new opportunities for performance optimization through custom kernelsEngage with ML researchers, framework developers (Just After eXecution (JAX), PyTorch) and compiler engineers (Accelerated Linear Algebra (XLA)) to enhance adoption Identify new requirements and address bottlenecks by providing appropriate solutions Requirements 12-18 years of experience in software developmentExpertise in optimizing TPU/GPU code using low-level kernel languages like Pallas, Compute Unified Device Architecture (CUDA) or Triton Knowledge of ML Frameworks (JAX/PyTorch), common operations like attention and Mixture of Experts (MoEs), including model optimization and low-precision formats Understanding of modern accelerators (e.g., data movement, pipelining, heterogeneous compute and scale-out)Understanding of compiler principles (optimization, code generation) and toolchains such as MLIR, OpenXLA Showcase of building developer infrastructure, including Open-Source Software (OSS) libraries, flexible high-performance APIs and easy-to-consume documentation to empower the community Excellent investigative and problem-solving capabilities with communication skills across cross-functional teams
Highlights
Safety Tips
Be careful with commission-based ’work-from-home’ positions that offer an unrealistically high income.
1 / 10
More info about this ad

Solution Architect - GPU/TPU Kernel Optimization has been posted in the Coimbatore Design & Architecture category on Locanto.

Why not check out other ads in this category, such as Transform Your Space with Zika Interior Design – Stylish, Functi, Coimbatore, Structural Steel detailer/PEB detailer/ Draftsman, Coimbatore or Best Interior Designer in Coimbatore in ganapathy, coimbatore. Currently, there are 6 ads posted in the Design & Architecture category in Coimbatore.

You can find the Design & Architecture category under Jobs. Want something else? Check out the related categories Other Jobs, Engineering and Accounting, Financing & Banking Coimbatore.

Interested in more? Widen your search to view ads in nearby areas of Coimbatore. This includes Design & Architecture in Sundapalaiyam, Madukkarai and Peelamedu. There are more ads within a 15 km radius for this category. If you want to view those ads, click here.