Member of Technical Staff - QA Lead
Patronus AI, Inc.
San Francisco, CA 94103Full Time
Job Description
About Patronus AI
Patronus AI is a frontier lab developing simulation research and infrastructure to accelerate progress toward human-aligned AGI. We are on a mission to simulate all of the world’s intelligence.
We are the team behind some of the earliest and most influential research in AI evaluation like FinanceBench, Lynx, SimpleSafetyTests, CopyrightCatcher, Humanity’s Last Exam, and more. We are formerly AI researchers and engineers from companies like Meta AI, Amazon AGI, and Google. Our customers include foundation model labs and Fortune 500 enterprises like Adobe. We are backed by top-tier investors like Lightspeed Venture Partners, Notable Capital, Stanford University, Noam Brown, Gokul Rajaram, and more.
Responsibilities
You are responsible for ensuring the quality and reliability of our products and systems. You will work closely with our engineering, AI and design teams to identify and resolve any potential issues or bugs, create testing plans and datasets, ensuring that our products meet the highest standards of performance and functionality. You will drive successful execution across a QA engineering team. Successful applicants are skilled, have prior QA and technical lead experience, and are extremely detail-oriented.
In this role, you will:
Lead a team of QA engineers to successfully drive execution across all QA projects.
Build and maintain automated testing across all surface areas: UI, APIs, and MCP
Define and continuously improve QA processes: test planning, criteria, and bug triage workflows.
Partner with engineering and AI research teams to understand requirements early; participate in technical design reviews and provide testability feedback
Establish team best practices and documentation
Qualifications
"The number one qualification to succeed in this machine learning course is gumption” - John Lafferty, CS Professor at Yale
Above all, we look for an eagerness to learn, passion for research, creativity in problem solving and a proactive mindset. You are a great fit if you have a background in the following:
5+ years of QA experience on web platforms in a production environment and building test automation with Python and/or Javascript
2+ years of experience leading QA initiatives on a team
Strong experience with REST APIs and API testing
Strong manual testing skills and the ability to design test cases that uncover edge cases and unexpected behavior
Experience writing and maintaining end-to-end tests (Playwright, Cypress, Selenium)
Experience with CI/CD test integration, test reliability, and debugging flaky tests
Experience executing load/performance tests and interpreting results
Comfort collaborating with AI/ML teams on datasets and evaluation workflows
Good character, integrity, and respect for others!
Benefits
Competitive salary and equity packages
Health, dental, and vision insurance plans
401(k) plan + matching
In-office private chef
Sponsored personal tax accounting
Whoop band, Oura ring, Function Health
Monthly meal stipend
Monthly health and wellness stipend
Equinox membership
Fun global offsites!
Patronus AI is an equal opportunity employer. We celebrate diversity in our workplace, and all qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or other legally protected characteristics.
Patronus AI is a frontier lab developing simulation research and infrastructure to accelerate progress toward human-aligned AGI. We are on a mission to simulate all of the world’s intelligence.
We are the team behind some of the earliest and most influential research in AI evaluation like FinanceBench, Lynx, SimpleSafetyTests, CopyrightCatcher, Humanity’s Last Exam, and more. We are formerly AI researchers and engineers from companies like Meta AI, Amazon AGI, and Google. Our customers include foundation model labs and Fortune 500 enterprises like Adobe. We are backed by top-tier investors like Lightspeed Venture Partners, Notable Capital, Stanford University, Noam Brown, Gokul Rajaram, and more.
Responsibilities
You are responsible for ensuring the quality and reliability of our products and systems. You will work closely with our engineering, AI and design teams to identify and resolve any potential issues or bugs, create testing plans and datasets, ensuring that our products meet the highest standards of performance and functionality. You will drive successful execution across a QA engineering team. Successful applicants are skilled, have prior QA and technical lead experience, and are extremely detail-oriented.
In this role, you will:
Lead a team of QA engineers to successfully drive execution across all QA projects.
Build and maintain automated testing across all surface areas: UI, APIs, and MCP
Define and continuously improve QA processes: test planning, criteria, and bug triage workflows.
Partner with engineering and AI research teams to understand requirements early; participate in technical design reviews and provide testability feedback
Establish team best practices and documentation
Qualifications
"The number one qualification to succeed in this machine learning course is gumption” - John Lafferty, CS Professor at Yale
Above all, we look for an eagerness to learn, passion for research, creativity in problem solving and a proactive mindset. You are a great fit if you have a background in the following:
5+ years of QA experience on web platforms in a production environment and building test automation with Python and/or Javascript
2+ years of experience leading QA initiatives on a team
Strong experience with REST APIs and API testing
Strong manual testing skills and the ability to design test cases that uncover edge cases and unexpected behavior
Experience writing and maintaining end-to-end tests (Playwright, Cypress, Selenium)
Experience with CI/CD test integration, test reliability, and debugging flaky tests
Experience executing load/performance tests and interpreting results
Comfort collaborating with AI/ML teams on datasets and evaluation workflows
Good character, integrity, and respect for others!
Benefits
Competitive salary and equity packages
Health, dental, and vision insurance plans
401(k) plan + matching
In-office private chef
Sponsored personal tax accounting
Whoop band, Oura ring, Function Health
Monthly meal stipend
Monthly health and wellness stipend
Equinox membership
Fun global offsites!
Patronus AI is an equal opportunity employer. We celebrate diversity in our workplace, and all qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or other legally protected characteristics.