AI Research Engineer (Multi-Modal Reinforcement Learning) - …, Bengaluru
-
Bengaluru, India
-
Posted: less than a week ago
-
Save
- Conduct research on reinforcement learning algorithms for multimodal models, including diffusion-based approaches for image autoregressive models for multimodal understanding, and unified frameworks that integrate multiple modalities.
- Design and build reinforcement learning infrastructure that supports scalable, distributed training across multimodal systems while maintaining efficiency and reliability.
- Develop and refine reward modeling strategies that improve training stability, align model behavior with desired outcomes, and mitigate reward hacking and related failure modes.
- Create and curate multimodal simulation environments and datasets to support robust training, evaluation, and benchmarking of reinforcement learning systems.
- Design and conduct rigorous benchmarking and evaluation protocols to measure model performance, track progress against baselines, and validate improvements across multimodal tasks.
- Analyze and optimize policy performance across modalities by identifying bottlenecks in training, credit assignment, and cross-modal alignment.
- Investigate and develop next-generation reinforcement learning paradigms that more effectively learn from workplace feedback, with the goal of achieving superior state-of-the-art (SOTA) performance.
- Publish research findings in top-tier conferences such as ICML, NeurIPS, ICLR, CVPR, ICCV, ECCV etc.
- A Master's degree in Computer Science or a related field is required; a PhD in Machine Learning, NLP, Computer Vision, or a closely related discipline is preferred, along with a strong track record of AI research and publications in top-tier conferences.
- Proven experience running large-scale reinforcement learning experiments in multimodal and vision-centric systems, including online RL settings, with demonstrated impact on domain-specific decision-making and measurable improvements in policy performance.
- Deep understanding of reinforcement learning algorithms and optimization methods applied to vision and multimodal learning problems, with a focus on improving policy stability, exploration, and sample efficiency in complex, high-dimensional environments involving images, video, and other modalities.
- Strong proficiency in PyTorch and deep learning frameworks for vision and multimodal AI, with hands-on experience building end-to-end RL pipelines covering simulation, training, evaluation, and deployment in production-grade systems.
- Demonstrated ability to apply empirical research to solve core RL challenges in multimodal and vision tasks, such as sample inefficiency, exploration-exploitation tradeoffs, and training instability, along with experience designing robust evaluation frameworks and iterating on algorithmic improvements to advance agent performance.
- Proven track record of research publications in top-tier conferences such as ICML, NeurIPS, ICLR, CVPR, ICCV, ECCV etc. Important information for candidates
Recruitment scams have become increasingly common. To protect yourself, please keep the following in mind when applying for roles:
- Apply only through our official channels. We do not use third-party platforms or agencies for recruitment unless clearly stated. All open roles are listed on our official careers page:
- Verify the recruiter’s identity. All our recruiters have verified LinkedIn profiles. If you’re unsure, you can confirm their identity by checking their profile or contacting us through our website.
- Be cautious of unusual communication methods. We do not conduct interviews over WhatsApp, Telegram, or SMS. All communication is done through official company emails and platforms.
- Double-check email addresses. All communication from us will come from emails ending in @ tether.to or @ tether.io
- We will never request payment or financial details. If someone asks for personal financial information or payment at any point during the hiring process, it is a scam. Please report it immediately. When in doubt, feel free to reach out through our official website. Apply on Kit Job: kitjob.in/job/4mc7rq
-
Company nameTether Operations
-
Job positionAI Research Engineer (Multi-Modal Reinforcement Learning) - 100% Remote Worldwide (Bengaluru)
AI Research Engineer (Multi-Modal Reinforcement Learning) - … has been posted in the Kasturba Road Engineering category on Locanto.
If you’re still wanting to browse, there is so much to explore in the Engineering category! Take a look at the ads Challenges Facing AI Robotics Innovation Labs, Bangalore North, Best MEP Design Course Online – Learn HVAC, Electrical & Plumbin, Bengaluru and Constructing and Commercialising Technology best engineering in Bangalore to discover more of what you’re looking for. In total, we have 3 ads in Engineering in Kasturba Road on Locanto classifieds.
Interested in more? Widen your search to view ads in nearby areas of Kasturba Road. This includes Engineering in Adugodi, Ulsoor and Vasanth Nagar. There are more ads within a 15 km radius for this category. If you want to view those ads, click here.