Know ATS Score
CV/Résumé Score
  • Expertini Resume Scoring: Our Semantic Matching Algorithm evaluates your CV/Résumé before you apply for this job role: Software Engineer, Model Behavior.
United States Jobs Expertini

Urgent! Software Engineer, Model Behavior Job Opening In San Francisco – Now Hiring OpenAI

Software Engineer, Model Behavior



Job description

About the Team

The Model Behavior team shapes how our models interact with people.

, aiming to create intuitive experiences that exceed user expectations and feel like magic.

The team partners closely with research and product teams across the company to improve the real-world usefulness of our models at scale.

Our work directly impacts hundreds of millions of users globally and contributes to OpenAI's mission of broadly distributing safe AI.

About the Role

We are looking for a full-stack engineer with experience in observability, tooling, and data pipelines to capture, aggregate, and surface production signals about human–model interactions.

You will build systems to understand and act on how users engage with our models and where our models fall short, and develop robust evaluations to define and track improvements in model behavior.

These signals power our understanding of real-world model performance and can be captured in creative ways beyond standard logging and metrics, requiring both technical skill and product intuition.

This role requires working across the stack — from building front-ends to surface & visualize insights to debugging back-end pipelines — and collaborating with cross-functional teams to ship iteratively under tight deadlines.

You should thrive in scrappy environments, quickly prototype solutions, and care deeply about end-user experience and aesthetics.


This role is based in San Francisco, CA.

We use a hybrid work model of three days in the office per week and offer relocation assistance to new employees.

In this role, you will:

  • Do the highest-leverage work to improve models for users at scale

  • Build systems to understand and act on how users engage with our models and where our models fall short from delivering the best experiences for our users

  • Develop robust evaluations to define and track improvements in model behavior

  • Rapidly prototype and develop tooling, dashboards, and visualizations for researchers and applied teams

  • Design, implement, test, and debug code across the research and product stack

  • Build and maintain robust telemetry, logging, and data pipelines to support production-scale model evaluation

  • Collaborate across research, safety, infrastructure, and product teams to deliver solutions that improve model efficiency and user experience

  • Own and support experiments that validate hypotheses around model behavior

  • You might thrive in this role if you:

  • Have experience building and maintaining full-stack observability tooling

  • Have built evaluations for capability and model improvements

  • Enjoy owning 0→1 user-facing products or tools, ideally in a startup or fast-moving environment

  • Ship quickly under competing priorities and tight deadlines

  • Understand how evaluations work and are curious about model training and iteration

  • Care about product polish, usability, and interface aesthetics

  • Collaborate effectively across teams and take on diverse tasks to move work forward

  • Are a team player, willing to do a variety of tasks that move the team forward

  • Bonus: understand AI/ML workloads and have experience building evaluation systems for them


  • Required Skill Profession

    Computer Occupations



    Your Complete Job Search Toolkit

    ✨ Smart • Intelligent • Private • Secure

    Start Using Our Tools

    Join thousands of professionals who've advanced their careers with our platform

    Rate or Report This Job
    If you feel this job is inaccurate or spam kindly report to us using below form.
    Please Note: This is NOT a job application form.


      Unlock Your Software Engineer Potential: Insight & Career Growth Guide