AI Benchmark Engineer (Planning/Operations) (Noida)
-
Noida, India
-
Posted: yesterday
-
Save
- Design and develop multi-agent benchmark tasks involving:
- Planning, scheduling, and resource allocation
- Operational decision-making (project management, logistics, incident response, capacity planning)
- Create constraint-rich problem statements with multiple interacting variables
- Develop verification scripts to evaluate:
- Feasibility (all constraints satisfied)
- Completeness (all requirements addressed)
- Optimality (effective solutions)
- Build decomposition strategies:
- Split tasks across specialized sub-agents (resource-based, constraint-based, conflict resolution, optimization)
- Model real-world operational scenarios with dependencies, timelines, and resource constraints
- Collaborate on improving task quality, coverage, and evaluation rigor Requirements:
- 5+ years of experience in operations or project management or logistics or supply chain or AI research or a strong computer science research background
- Strong ability to formalize constraints, dependencies, and scheduling logic
- Proficiency in Python for building verification and validation scripts
- Strong structured problem-solving and decomposition skills
- Clear and precise technical writing skills
- Experience with AI coding benchmarks (e.g., SWE-bench, Terminal-bench)
- Hands-on experience with Docker (Dockerfiles, image builds, debugging) Nice to have:
- Experience with optimization techniques (linear programming, constraint satisfaction, scheduling algorithms)
- Background in operations research
- Experience with simulation or modeling tools
- Knowledge of AI planning systems or automated reasoning
- Project management experience or certifications (PMP, Agile, etc.) Perks of Freelancing With Turing:
- Work in a fully remote environment.
- Opportunity to work on cutting-edge AI projects with leading LLM companies. Offer Details:
- Commitments Required: 40 hours per week with overlap of 4 hours with PST.
- Engagement Type: Contractor assignment (no medical/paid leave)
- Duration of Contract: 4 weeks (adjustable based on engagement) Apply on Kit Job: kitjob.in/job/4n9hit
-
Company nameTuring
-
Job positionAI Benchmark Engineer (Planning/Operations) (Noida)
AI Benchmark Engineer (Planning/Operations) (Noida) has been posted in the Noida Engineering category on Locanto.
If you’re looking for something similar, check out Best Private Institute for Engineering in Noida – Accurate, Noida, Boost CNC Precision with 4th Axis Rotary Tables Today!, Noida or Robotics Jobs Online – Start Your Future as a Remote Robot Opera in Noida, also posted in Engineering. Right now, there are 4 classified ads in Engineering in Noida on Locanto.
Interested in more? Widen your search to view ads in nearby areas of Noida. This includes Engineering in Jangpura, Kālkāji Devi and Lajpat Nagar. There are more ads within a 15 km radius for this category. If you want to view those ads, click here.