Lilt Careers | Job Vacancies & Opportunities Page 3

We are building a rigorous verifiable evaluation suite of Terminal-Bench tasks designed to test the limits of large language models on multilingual software challenges. Our goal is to measure multilingual robustness across prompt language effects non-English data processing and comple

Employer Active

12 days ago

Contract

Tokyo - Japan

Apply Now

We are building a rigorous verifiable evaluation suite of Terminal-Bench tasks designed to test the limits of large language models on multilingual software challenges. Our goal is to measure multilingual robustness across prompt language effects non-English data processing and comple

Employer Active

12 days ago

Contract

Berlin - Germany

Apply Now

We are building a rigorous verifiable evaluation suite of Terminal-Bench tasks designed to test the limits of large language models on multilingual software challenges. Our goal is to measure multilingual robustness across prompt language effects non-English data processing and comple

Employer Active

12 days ago

Contract

Madrid - Spain

Apply Now

We are building a rigorous verifiable evaluation suite of Terminal-Bench tasks designed to test the limits of large language models on multilingual software challenges. Our goal is to measure multilingual robustness across prompt language effects non-English data processing and comple

Employer Active

12 days ago

Contract

Tokyo - Japan

Apply Now

We are building a rigorous verifiable evaluation suite of Terminal-Bench tasks designed to test the limits of large language models on multilingual software challenges. Our goal is to measure multilingual robustness across prompt language effects non-English data processing and comple

Employer Active

12 days ago

Contract

New Delhi - India

Apply Now

We are building a rigorous verifiable evaluation suite of Terminal-Bench tasks designed to test the limits of large language models on multilingual software challenges. Our goal is to measure multilingual robustness across prompt language effects non-English data processing and comple

Employer Active

12 days ago

Contract

Prague - Czech Republic

Apply Now

We are building a rigorous verifiable evaluation suite of Terminal-Bench tasks designed to test the limits of large language models on multilingual software challenges. Our goal is to measure multilingual robustness across prompt language effects non-English data processing and comple

Employer Active

12 days ago

Contract

Abuja - Nigeria

Apply Now

We are building a rigorous verifiable evaluation suite of Terminal-Bench tasks designed to test the limits of large language models on multilingual software challenges. Our goal is to measure multilingual robustness across prompt language effects non-English data processing and comple

Employer Active

12 days ago

Contract

Berlin - Germany

Apply Now

We are building a rigorous verifiable evaluation suite of Terminal-Bench tasks designed to test the limits of large language models on multilingual software challenges. Our goal is to measure multilingual robustness across prompt language effects non-English data processing and comple

Employer Active

12 days ago

Contract

Madrid - Spain

Apply Now

OverviewLILT is building a global network of domain experts to support high-quality AI evaluation across training benchmarking red-teaming and ongoing model optimization. We are seeking legal and compliance professionals to contribute expert judgment to human-in-the-loop AI evaluation

Employer Active

17 days ago

Contract

Riyadh - Saudi Arabia

Apply Now

OverviewLILT is building a global network of domain experts to support high-quality AI evaluation across training benchmarking red-teaming and ongoing model optimization. We are seeking legal and compliance professionals to contribute expert judgment to human-in-the-loop AI evaluation

Employer Active

17 days ago

Contract

Abu Dhabi - UAE

Apply Now

OverviewLILT is building a global network of domain experts to support high-quality AI evaluation across training benchmarking red-teaming and ongoing model monitoring. We are seeking software engineering and DevOps professionals to contribute expert judgment to human-in-the-loop AI e

Employer Active

17 days ago

Contract

Riyadh - Saudi Arabia

Apply Now

OverviewLILT is building a global network of domain experts to support high-quality AI evaluation across training benchmarking red-teaming and ongoing model monitoring. We are seeking finance and investment professionals to contribute expert judgment to human-in-the-loop AI evaluation

Employer Active

17 days ago

Contract

Abu Dhabi - UAE

Apply Now

OverviewLILT is building a global network of domain experts to support high-quality AI evaluation across training benchmarking red-teaming and ongoing model monitoring. We are seeking software engineering and DevOps professionals to contribute expert judgment to human-in-the-loop AI e

Employer Active

17 days ago

Contract

Abu Dhabi - UAE

Apply Now

OverviewLILT is building a global network of domain experts to support high-quality AI evaluation across training benchmarking red-teaming and ongoing model monitoring. We are seeking finance and investment professionals to contribute expert judgment to human-in-the-loop AI evaluation

Employer Active

17 days ago

Contract

Riyadh - Saudi Arabia

Apply Now

OverviewLILT is building a global network of domain experts to support high-quality AI evaluation across training benchmarking red-teaming and ongoing model monitoring. We are seeking software engineering and DevOps professionals to contribute expert judgment to human-in-the-loop AI e

Employer Active

17 days ago

Contract

Berlin - Germany

Apply Now

OverviewLILT is building a global network of domain experts to support high-quality AI evaluation across training benchmarking red-teaming and ongoing model monitoring. We are seeking finance and investment professionals to contribute expert judgment to human-in-the-loop AI evaluation

Employer Active

17 days ago

Contract

Berlin - Germany

Apply Now

OverviewLILT is building a global network of domain experts to support high-quality AI evaluation across training benchmarking red-teaming and ongoing model optimization. We are seeking legal and compliance professionals to contribute expert judgment to human-in-the-loop AI evaluation

Employer Active

17 days ago

Contract

Berlin - Germany

Apply Now

About the RoleWe are looking for data-driven Project Managers to lead our large-scale multilingual data collection and Large Language Model (LLM) evaluation this role you will be the operational backbone of our AI development orchestrating global teams of annotators and data speciali

Employer Active

18 days ago

Contract

Buenos Aires - Argentina

Apply Now

About LILTAI is changing how the world communicates and LILT is leading that transformation.Were on a mission to make the worlds information accessible to everyone regardless of the language they speak. We use cutting-edge AI machine translation and human-in-the-loop expertise to tra

Employer Active

22 days ago

Contract

Indianapolis - USA

Apply Now

LILT

232 Job openings in LILT

Ai Benchmark Engineer Native Language Specialist | Japanese

Ai Benchmark Engineer Native Language Specialist | German

Ai Benchmark Engineer Native Language Specialist | Spanish

Ai Benchmark Engineer Native Language Specialist | Japanese

Ai Benchmark Engineer Native Language Specialist | Marathi

Ai Benchmark Engineer Native Language Specialist | Czech

Ai Benchmark Engineer Native Language Specialist | Hausa

Ai Benchmark Engineer Native Language Specialist | German

Ai Benchmark Engineer Native Language Specialist | Spanish

Legal & Compliance Ai Raterevaluator

Legal & Compliance Ai Raterevaluator

Software Engineering & Devops Ai Raterevaluator

Finance & Investment Ai Raterevaluator

Software Engineering & Devops Ai Raterevaluator

Finance & Investment Ai Raterevaluator

Software Engineering & Devops Ai Raterevaluator

Finance & Investment Ai Raterevaluator

Legal & Compliance Ai Raterevaluator

Project Manager, Ai Data

Business Intelligence Analyst

About LILT

Jobs You Might Be Interested In

AI Benchmark Engineer Native Language Sp...

AI Benchmark Engineer Native Language Sp...