Ai Benchmark Engineer – Mathematical Reasoning Kollam
Ai Benchmark Engineer – Mathematical Reasoning Kollam
-
Kollam, India
-
Posted: a week ago
-
Save
Description
Role Overview
We are seeking a highly analytical and computationally proficient individual to join our team with a solid research background. You will be instrumental in contributing to this role by either crafting challenging and insightful problems in your respective research domain or devising elegant computational solutions.
Responsibilities:
Build multi-agent benchmark tasks that require multi-step mathematical reasoning, proof construction, or algorithmic problem-solving
Design problems that are genuinely hard for a single agent but decomposable — competition math, numerical analysis, combinatorial optimisation, statistical inference
Create verification scripts that check mathematical correctness — numerical answers with appropriate tolerance, proof step validity, and algorithm output correctness
Write clear problem statements with exact notation, definitions, and output format
Create decomposition guides that split problems into independent sub-computations or parallel solution strategies
Offer Details
Pay: INR 1.75 to 2 Lakhs per month
Mode of work: Fully Remote
Duration: 12 months (likely extended)
Number of positions: 15
Required Qualifications:
5+ years in mathematics, quantitative research, or computational science — competition math, university-level mathematics, or quantitative research background. Python programming — NumPy, SciPy, or symbolic computation (SymPy). Experience writing mathematical proofs or formal derivations.
Ability to create problems with exact, verifiable answers — not subjective or open-ended.
Experience with AI coding benchmarks (SWE-bench, Terminal-bench). Comfortable with Docker — writing Dockerfiles, building images, and debugging container issues.
Understanding of numerical methods — floating-point tolerance, convergence criteria, and error bounds.
Solid plus:
Experience creating math competition problems (AMC, AIME, Putnam, IMO, or similar).
Research in mathematics, theoretical CS, or q Apply on Kit Job: kitjob.in/job/4lrxoq
We are seeking a highly analytical and computationally proficient individual to join our team with a solid research background. You will be instrumental in contributing to this role by either crafting challenging and insightful problems in your respective research domain or devising elegant computational solutions.
Responsibilities:
Build multi-agent benchmark tasks that require multi-step mathematical reasoning, proof construction, or algorithmic problem-solving
Design problems that are genuinely hard for a single agent but decomposable — competition math, numerical analysis, combinatorial optimisation, statistical inference
Create verification scripts that check mathematical correctness — numerical answers with appropriate tolerance, proof step validity, and algorithm output correctness
Write clear problem statements with exact notation, definitions, and output format
Create decomposition guides that split problems into independent sub-computations or parallel solution strategies
Offer Details
Pay: INR 1.75 to 2 Lakhs per month
Mode of work: Fully Remote
Duration: 12 months (likely extended)
Number of positions: 15
Required Qualifications:
5+ years in mathematics, quantitative research, or computational science — competition math, university-level mathematics, or quantitative research background. Python programming — NumPy, SciPy, or symbolic computation (SymPy). Experience writing mathematical proofs or formal derivations.
Ability to create problems with exact, verifiable answers — not subjective or open-ended.
Experience with AI coding benchmarks (SWE-bench, Terminal-bench). Comfortable with Docker — writing Dockerfiles, building images, and debugging container issues.
Understanding of numerical methods — floating-point tolerance, convergence criteria, and error bounds.
Solid plus:
Experience creating math competition problems (AMC, AIME, Putnam, IMO, or similar).
Research in mathematics, theoretical CS, or q Apply on Kit Job: kitjob.in/job/4lrxoq
Highlights
-
Company nameMillionlogics
-
Job positionAi Benchmark Engineer – Mathematical Reasoning Kollam
Safety Tips
Be careful with jobs that explicitly state ’no experience needed’.
More info about this ad
Ai Benchmark Engineer – Mathematical Reasoning Kollam has been posted in the Quilon Engineering category on Locanto.
Right now, this is the only ad posted in this category in Quilon.
There are more ads within a 15 km radius for this category. If you want to view those ads, click here.