Demo

Research Scientist, World Models – Policy Training and Evaluation

Toyota Research Institute
Los Altos, CA Full Time
POSTED ON 8/5/2025 CLOSED ON 9/3/2025

What are the responsibilities and job description for the Research Scientist, World Models – Policy Training and Evaluation position at Toyota Research Institute?

At Toyota Research Institute (TRI), we’re on a mission to improve the quality of human life. We’re developing new tools and capabilities to amplify the human experience. To lead this transformative shift in mobility, we’ve built a world-class team in Energy & Materials, Human-Centered AI, Human Interactive Driving, Large Behavioral Models, and Robotics.


Within the Human Interactive Driving division, the Extreme Performance Intelligent Control department is working to develop scalable, human-like driving intelligence by learning from expert human drivers. This project focuses on creating a configurable, data-driven world model that serves as a foundation for intelligent, multi-agent reasoning in dynamic driving environments. By tightly integrating advances in perception, world modeling, and model-based reinforcement learning, we aim to overcome the limitations of more compartmentalized, rule-based approaches. The end goal is to enable robust, adaptable, and interpretable driving policies that generalize across tasks, sensor modalities, and public road scenarios—delivering transformative improvements for ADAS, autonomous systems, and simulation-driven software development. 


We are looking for a creative and rigorous Research Scientist to focus on tailoring world models for effective use in policy learning and evaluation for autonomous vehicles. In this role, you will be at the heart of research efforts that bridge perception-driven environment models and the training of intelligent decision-making policies. Your work will ensure that learned world models can serve as faithful, controllable, and informative substrates for safe and robust policy optimization and evaluation.


Responsibilities
  • Develop and refine world models that support realistic and diverse counterfactual reasoning, scenario generation, and policy rollout.
  • Ensure that world models are compatible with and useful for reinforcement learning, imitation learning, and offline policy evaluation techniques.
  • Design methods to synthesize high-risk or edge-case scenarios from world models, enabling robust stress-testing of autonomous policies.
  • Explore techniques such as latent-space simulation, world model distillation, differentiable simulation, and closed-loop evaluation to improve policy development and evaluation pipelines.
  • Partner with researchers in world modeling, planning, and safety evaluation to co-develop aligned architectures and learning objectives to ensure that learned models accurately capture agent-environment dynamics relevant to long-horizon planning and safety-critical decision-making.
  • Publish high-quality research and contribute to the community through open-source tools, benchmarks, and conference participation.


Qualifications
  • PhD in Computer Science, Robotics, Machine Learning, or a related field.
  • Strong background in at least two of the following areas: World models or model-based reasoning in dynamic environments, World model adaptation and fine-tuning, Offline RL or imitation learning, Model-based reinforcement learning (MBRL), Simulation-to-reality transfer, or Policy evaluation and safety assurance.
  • A track record of high-quality publications in ML or robotics venues (e.g., ICML, ICLR, NeurIPS, CoRL, RSS).
  • Familiarity with latent dynamics models (e.g., Dreamer, PlaNet, MuZero).
  • Understanding of uncertainty modeling, generalization, and robustness in learned environments.Experience evaluating autonomous vehicle policies in simulation and real-world settings.
  • Experience in building or applying models for downstream evaluation of autonomous systems.
  • Proficiency in Python and ML frameworks (e.g., PyTorch, JAX).



Please submit a brief cover letter and add a link to Google Scholar to include a full list of publications when submitting your CV for this position.


The pay range for this position at commencement of employment is expected to be between $176,000 and $264,000/year for California-based roles; however, base pay offered may vary depending on multiple individualized factors, including market location, job-related knowledge, skills, and experience. Note that TRI offers a generous benefits package (including 401(k) eligibility and various paid time off benefits, such as vacation, sick time, and parental leave) and an annual cash bonus structure. Details of participation in these benefit plans will be provided if an employee receives an offer of employment.


Please reference this Candidate Privacy Notice to inform you of the categories of personal information that we collect from individuals who inquire about and/or apply to work for Toyota Research Institute, Inc. or its subsidiaries, including Toyota A.I. Ventures GP, L.P., and the purposes for which we use such personal information.


TRI is fueled by a diverse and inclusive community of people with unique backgrounds, education and life experiences. We are dedicated to fostering an innovative and collaborative environment by living the values that are an essential part of our culture. We believe diversity makes us stronger and are proud to provide Equal Employment Opportunity for all, without regard to an applicant’s race, color, creed, gender, gender identity or expression, sexual orientation, national origin, age, physical or mental disability, medical condition, religion, marital status, genetic information, veteran status, or any other status protected under federal, state or local laws.


It is unlawful in Massachusetts to require or administer a lie detector test as a condition of employment or continued employment. An employer who violates this law shall be subject to criminal penalties and civil liability. Pursuant to the San Francisco Fair Chance Ordinance, we will consider qualified applicants with arrest and conviction records for employment.

Salary : $176,000 - $264,000

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Research Scientist, World Models – Policy Training and Evaluation?

Sign up to receive alerts about other jobs on the Research Scientist, World Models – Policy Training and Evaluation career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$102,775 - $137,396
Income Estimation: 
$153,127 - $203,425
Income Estimation: 
$139,626 - $193,276
Income Estimation: 
$164,650 - $211,440
Income Estimation: 
$130,030 - $173,363
Income Estimation: 
$130,030 - $173,363
Income Estimation: 
$194,895 - $259,743
Income Estimation: 
$192,057 - $260,440
Income Estimation: 
$249,515 - $311,938
Income Estimation: 
$155,477 - $213,492
Income Estimation: 
$68,606 - $89,684
Income Estimation: 
$88,975 - $120,741
Income Estimation: 
$68,121 - $81,836
Income Estimation: 
$71,928 - $87,026
Income Estimation: 
$125,958 - $157,570
Income Estimation: 
$82,813 - $108,410
Income Estimation: 
$120,989 - $162,093
Income Estimation: 
$74,806 - $91,633
Income Estimation: 
$71,928 - $87,026
Income Estimation: 
$145,337 - $174,569
Income Estimation: 
$102,775 - $137,396
Income Estimation: 
$153,127 - $203,425
Income Estimation: 
$139,626 - $193,276
Income Estimation: 
$164,650 - $211,440
Income Estimation: 
$130,030 - $173,363
This job has expired.
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Toyota Research Institute

Toyota Research Institute
Hired Organization Address Los Altos, CA Full Time
At Toyota Research Institute (TRI), we’re on a mission to improve the quality of human life. We’re developing new tools ...
Toyota Research Institute
Hired Organization Address Los Altos, CA Full Time
At Toyota Research Institute (TRI), we’re on a mission to improve the quality of human life. We’re developing new tools ...
Toyota Research Institute
Hired Organization Address Los Altos, CA Full Time
At Toyota Research Institute (TRI), we’re on a mission to improve the quality of human life. We’re developing new tools ...

Not the job you're looking for? Here are some other Research Scientist, World Models – Policy Training and Evaluation jobs in the Los Altos, CA area that may be a better fit.

Research Scientist, Latent State Inference for World Models

Toyota Research Institute, Los Altos, CA

AI Assistant is available now!

Feel free to start your new journey!