NeurIPS 2025 Career Opportunities
Here we highlight career opportunities submitted by our Exhibitors, and other top industry, academic, and non-profit leaders. We would like to thank each of our exhibitors for supporting NeurIPS 2025.
Search Opportunities
Hong Kong
Flow Traders is looking for a Senior Research Engineer to join our Hong Kong office. This is a unique opportunity to join a leading proprietary trading firm with an entrepreneurial and innovative culture at the heart of its business. We value quick-witted, creative minds and challenge them to make full use of their capacities.
As a Senior Research Engineer, you will be responsible for helping to lead the development of our trading model research framework and using it to conduct research to develop models for trading in production. You'll expand the framework to become global standard way of training, consuming, combining, and transforming any data source in a data-driven systematic way. You will then partner with Quantitative Researchers to build the trading models themselves.
What You Will Do
- Help to lead the development and global rollout of our research framework for defining and training models through various optimization procedures (supervised learning, backtesting etc.), as well as its integration with our platform for deploying and running those models in production
- Partner with Quantitative Researchers to conduct research: test hypotheses and tune/develop data-driven systematic trading strategies and alpha signals
What You Need to Succeed
- Advanced degree (Master's or PhD) in Machine Learning, Statistics, Physics, Computer Science or similar
- 8+ years of hands-on experience MLOps, Research Engineering, or ML Research
- A strong background in mathematics and statistics
- Strong proficiency in programming languages such as Python, with experience in libraries like numpy, pytorch, polars, pandas, and ray.
- Demonstrated experience in designing and implementing end-to-end machine learning pipelines, including data preprocessing, model training, deployment, and monitoring
- Understanding of and experience with modern software development practices and tools (e.g. Agile, version control, automated testing, CI/CD, observability)
- Understanding of cloud platforms (e. g., AWS, Azure, GCP) and containerization technologies (e. g., Docker, Kubernetes)
The role We seek an experienced Senior ML Solutions Architect to support customers leveraging Nebius Token Factory's serverless inference platform for open-source LLMs across multiple modalities. In this role, you will be collaborating with clients to design and implement customized LLM-based solution and architect scalable AI applications using our served models, and working together with our backend team to improve our platform to match the clients' needs.
You’re welcome to work remotely from the United States or Canada.
Your responsibilities will include: - Design and implement LLM-based solutions using Nebius Token Factory’s inference services to drive business value and support customer goals. - Build production-ready applications leveraging our serverless LLM APIs, including multimodal models (text, vision, audio) and domain-specific models. - Provide technical expertise in prompt engineering, RAG architectures, model selection, and inference optimization. - Collaborate with product and engineering teams to surface customer feedback and shape the platform roadmap. - Guide customers in scaling from POC to production with a focus on performance, reliability, and cost efficiency.
We expect you to have: - 5+ years of experience in ML/AI systems, with at least 2 years focused on LLMs and generative AI. - Deep knowledge of the LLM ecosystem, including model architectures and fine-tuning approaches.
Hands-on experience with: - Prompt engineering and LLM pipeline development, including evaluation. - Agentic frameworks such as Langchain, Langsmith, smolagents, or equivalent. - Vector databases and RAG implementation patterns. - Deploying LLM-powered applications using APIs from OpenAI, Anthropic, or open-source models. - Strong Python programming skills. - Excellent communication skills, with the ability to clearly explain technical concepts to diverse audiences.
It will be an added bonus if you have: - Experience with inference frameworks and libraries (e.g., vLLM, SGLang, TensorRT-LLM, Transformers). - Familiarity with inference optimization techniques such as quantization, batching, caching, and routing. - Work with multimodal AI models (e.g., vision-language, speech). - Proficiency with DevOps tools (Docker, Kubernetes). - Contributions to open-source ML/AI projects.
Preferred tooling: - Programming Languages – Python - ML Frameworks and Libraries– vLLM, SGLang, TensorRT-LLM, Transformers, OpenAI/Anthropic SDKs - Frameworks for Agentic Pipelines : Langchain / Langsmith / smolagents / equivalent - API and Web Frameworks– FastAPI, Flask - MLOps and DevOps tools– Kubernetes (K8s), Docker, Git - Cloud Platforms– AWS (SageMaker, Bedrock), GCP (Vertex AI), Azure (Azure ML)
NVIDIA is developing the NVIDIA DRIVE AV Solution (NDAS), powered by the latest advancement in AI and accelerated computing. We are seeking a highly motivated software expert to join our Autonomous Vehicles (AV) Drive-Alpha team in US Santa Clara. You will be driving the engineering execution of feature development or exceeding the meaningful metric requirements, especially for L2++ and L3/L4.
Drive-Alpha consists of proficient domain-experts spanning the full stack of autonomous driving, including perception, fusion, prediction, planning and control, autonomous model, many with proven development experiences for the highly competitive market in key functions like Highway NOA (Navigation on Autopilot) and Urban NOA (Navigation on Autopilot), as well as p2p driving (including parking). This team is responsible for the integration and sign-off of NDAS component teams' merge request/change lists, promotes validated changes to merge into stable branch, analyzes the root-cause of the regression identified, and drive the corrective actions taken by component engineering teams for a productive CI/CD process of SW development. Also, the team members tightly integrate into component teams' development, acting as a dependency resolver for the component teams to deliver cross-function improvement that are most impactful to NDAS product. We nurture teamwork among component teams' engineers and establish positive relationships and communications with partner organizations.
What you’ll be doing: Provide in-depth and insightful technical feedback on the quality of NDAS L2++/L3/L4 SW stack, based on the performance metrics proven through offline-replay and in-car testing. Identify the weak link of the L2++/L3/L4 SW stack and make it strong. Integrate, test and sign-off SW stack's code change and model update, and drive the Root-Cause-Corrective-Action (RCCA) process to continuously improve the quality of NDAS SW. Decompose a complicated cross-function problem into actionable items and coordinate a concerted effort among multiple collaborators. Join force with component team developers, when necessary, provide your domain expert input to a solution for hard problems, and produce production-quality code to component code base.
What we need to see: BS/MS in Electrical Engineering, Computer Science, or related fields or equivalent experience. 5+ years related experience in software development, with hands-on dev experience in AD for automotive. Great coding skills in modern C++ and scripting languages like Python. Deep understanding of L2++/L3/L4 product features in the market. Hands-on experience in debugging AD SW problems. Excellent communication and interpersonal skills with ability to strive in a cross-disciplinary environment.
Ways To Stand Out From The Crowd: Experience of working as a hands-on tech lead for one or more autonomous driving components. Hands-on development experience of an SOP-ed AD and/or ADAS product. Rich experience of in-car testing with great intuition of first-level triaging (from symptom to component). Familiar with CI/CD process, test automation, Jenkins, Log-Sim reply.
NVIDIA has some of the most forward-thinking and hardworking people in the world. If you're creative and autonomous, we want to hear from you! Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 136,000 USD - 212,750 USD.
You will also be eligible for equity and benefits.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other char
We are now looking for a Senior Research Scientist focused on Multimodal Foundation Models and Robotics! NVIDIA is searching for an outstanding research scientist to build humanoid robot foundation models and systems in the Generalist Embodied Agent Research (GEAR) group. Everything that moves will eventually be autonomous. Our mission is to build general-purpose embodied agents that learn to explore and master complex skills across the virtual and the physical world.
You will work with an amazing and collaborative research team that consistently produces influential works on multimodal foundation models, large-scale robot learning, game AI, and physical simulation. Our past projects include Eureka, VIMA, Voyager, MineDojo, MimicPlay, Prismer, and more. One of our team’s most recent milestones includes Project GR00T, a foundation model for humanoid robots. Your contributions will have a significant impact on our moonshot research projects and product roadmaps.
What you will be doing: Design and implement novel AI algorithms and models for general-purpose humanoid robots and embodied agents;
Develop large-scale AI training and inference methods for foundation models;
Optimize and deploy AI models in physical simulation and on robot hardware;
Collaborate with research and engineering teams across all of NVIDIA to transfer research to products and services.
What we need to see: A Ph.D. in Computer Science/Engineering, Electrical Engineering, etc., or equivalent research experience.
5 years of relevant work/research experience across one or both of these fields: Multimodal Foundation Models
Hands-on training experience and publications in at least one of the following topics: LLMs; Large vision-language models; Video generative models and diffusion algorithms; or Action-based transformers.
Outstanding engineering skills in rapid prototyping and model training frameworks (PyTorch, Jax, Tensorflow, etc.). Python is required; C++ and CUDA proficiencies are a big plus;
Excellent skills in working with large-scale machine learning/AI systems and compute infrastructure.
Robotics: Hands-on training experience and publications in robot learning, such as reinforcement learning, imitation learning, classical control methods, etc.
Strong programming skills in Python, C++, ROS, and machine learning frameworks like PyTorch.
Deep understanding of robot kinematics, dynamics, and sensors;
Ability to safely operate robot hardware, lab equipment, and tools;
Knowledge of control methods, including PID, model predictive control, and whole-body control;
Familiarity with physics simulation frameworks such as MuJoCo and Isaac Sim;
Robot hardware design and hands-on building experience.
NVIDIA is widely considered to be one of the technology world's most desirable employers. We have some of the most forward-thinking and productive people in the world. Please join us and be part of the forefront of developing general-purpose robots and embodied agents!
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 299,000 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5.
You will also be eligible for equity and benefits.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
Palo Alto, CA
Position Description: Tesla’s AI team is pushing the frontier of real-world machine learning, building models that reason, predict, and act with human-level physical intelligence. We train and deploy large-scale ML systems powering products from Autopilot to Optimus.
As part of the Model Optimization group, you will work at the intersection of machine learning and systems, designing our most advanced models to run efficiently across Tesla’s diverse compute stack, from data centers to edge AI accelerators. You will design the model architecture and engineer algorithmic optimizations that make large-scale model inference fast, reliable, and hardware-aware.
Responsibilities: Design, train, and deploy large neural networks that run efficiently on heterogeneous hardware (GPU, CPU, Tesla’s in-house AI ASIC) Develop and integrate quantization, sparsity, pruning, and distillation techniques to improve inference performance Design inference algorithms that improve inference performance in terms of quantization and latency Profile and improve latency, throughput, and memory efficiency for large ML models across edge and cloud environments Collaborate with compiler and hardware engineers to co-design architectures for efficient real-time inference Design and implement custom GPU kernels (CUDA / OpenCL) to accelerate model operations and post-processing pipelines Conduct systematic benchmarking, scaling, and validation of inference performance across Tesla platforms
Requirements: Proven experience in scaling and optimizing inference for large ML models, particularly transformers or similar architectures Familiarity with quantization-aware training, model compression, and distillation for edge and real-time inference Proficiency with Python and C++ (modern standards 14/17/20) and deep learning frameworks such as PyTorch, TensorFlow, or JAX Strong understanding of computer systems and architecture, with experience deploying ML models on GPUs, TPUs, or NPUs Hands-on expertise with CUDA programming, low-level performance profiling, and compiler-level optimization (TensorRT, TVM, XLA) Experience collaborating with compiler/hardware engineers to bridge model and system-level optimization Excellent problem-solving skills and the ability to debug and tune high-performance inference workloads
Redwood City, CA
Biohub is leading the new era of AI-powered biology to cure or prevent disease through its 501c3 medical research organization, with the support of the Chan Zuckerberg Initiative.
The Team Biohub supports the science and technology that will make it possible to help scientists cure, prevent, or manage all diseases by the end of this century. While this may seem like an audacious goal, in the last 100 years, biomedical science has made tremendous strides in understanding biological systems, advancing human health, and treating disease.
Achieving our mission will only be possible if scientists are able to better understand human biology. To that end, we have identified four grand challenges that will unlock the mysteries of the cell and how cells interact within systems — paving the way for new discoveries that will change medicine in the decades that follow:
Building an AI-based virtual cell model to predict and understand cellular behavior
Developing novel imaging technologies to map, measure and model complex biological systems
Creating new tools for sensing and directly measuring inflammation within tissues in real time.tissues to better understand inflammation, a key driver of many diseases
Harnessing the immune system for early detection, prevention, and treatment of disease
The Opportunity
At Biohub, we are generating unprecedented scientific datasets that drive biological modeling innovation:
Billions of standardized cells of single-cell transcriptomic data, with a focus on measuring genetic and environmental perturbations 10s of thousands of donor-matched DNA & RNA samples PB-scale static and dynamic imaging datasets TB-scale mass spectrometry datasets Diverse, large multi-modal biological datasets that enable biological bridges across measurement types and facilitate multi-modal model training to define how cells act. After model training, we make all data products available through public resources like CELLxGENE Discover and the CryoET Portal, used by tens of thousands of scientists monthly to advance understanding of genetic variants, disease risk, drug toxicities, and therapeutic discovery.
As a Senior Staff Data Scientist, you'll lead the creation of groundbreaking imaging datasets that decode cellular function at the molecular level, describe development, and predict responses to genetic or environmental changes. Working at the intersection of data science, biology, and AI, you'll define data needs, format standards, analysis approaches, quality metrics, and our technical strategy, creating systems to ingest, transform, and validate and deploy data products.
Success for this role means delivering high-quality, usable datasets that directly address modeling challenges and accelerate scientific progress. Join us in building the data foundation that will transform our understanding of human biology and move us along the path to curing, preventing, and managing all disease.
New York
Quantitative Analyst Ph.D. Intern (New York) – Summer 2026
The D. E. Shaw group seeks talented Ph.D. candidates with impressive records of academic and/or professional achievement to join the firm as quantitative analyst interns. Ph.D. interns explore how the analytical skills gained from their graduate programs may relate to the work done at the firm while interacting with fellow interns and employees of similar academic backgrounds in a collegial working environment. This 12-week program will take place in New York and is expected to run from June to August 2026.
What you'll do day-to-day
You’ll spend the summer working on a research project that typically involves exploring a variety of statistical modeling techniques and writing software to analyze financial data. You’ll have a dedicated mentor in one of our quantitative research groups and are encouraged to attend our academic speaker series and track academic progress in various areas that may be of interest.
Who we're looking for
- Individuals with impressive records of academic achievement, including advanced coursework in fields such as math, statistics, physics, engineering, computer science, or other technical and quantitative programs.
- Applicants should have notable research productivity in their respective areas of study as well as a track record of creativity in their field(s).
- Interest or experience working in a data-driven research environment, including manipulation of data using high-level programming languages such as Python, is preferred.
- An exceptional aptitude for abstract reasoning, problem solving, and quantitative thinking, in addition to prior probability or statistics knowledge, is a plus.
- No previous finance experience is necessary, though candidates should have an interest in learning about quantitative finance.
- Students who apply to this internship are usually approaching their final year of full-time study.
- The position offers a monthly base salary of 25,000USD, overtime pay, a sign-on bonus of 25,000USD, travel coverage to and from the internship, and choice of furnished summer housing or a 10,000USD housing allowance. It also includes a 3,300USD stipend for self-study materials and a 4,000USD stipend for personal technology equipment. If you have any questions about the compensation, please ask one of our recruiters.
Successful hires will expand the group's efforts applying machine learning to drug discovery, biomolecular simulation, and biophysics. Areas of focus include generative models to help identify novel molecules for drug discovery targets, predict PK and ADME properties of small molecules, develop more accurate approaches for molecular simulations, and understand disease mechanisms. Ideal candidates will have strong Python programming skills. Relevant areas of experience might include deep learning techniques, systems software, high performance computation, numerical algorithms, data science, cheminformatics, medicinal chemistry, structural biology, molecular physics, and/or quantum chemistry, but specific knowledge of any of these areas is less critical than intellectual curiosity, versatility, and a track record of achievement and innovation in the field of machine learning. For more information, visit www.DEShawResearch.com.
Please apply using this link: https://apply.deshawresearch.com/careers/Register?pipelineId=597&source=NeurIPS_1
The expected annual base salary for this position is USD 300,000 - USD 800,000. Our compensation package also includes variable compensation in the form of sign-on and year-end bonuses, and generous benefits, including relocation and immigration assistance. The applicable annual base salary paid to a successful applicant will be determined based on multiple factors including the nature and extent of prior experience and educational background. We follow a hybrid work schedule, in which employees work from the office on Tuesday through Thursday, and have the option of working from home on Monday and Friday.
D. E. Shaw Research, LLC is an equal opportunity employer.
Chicago / Austin
As a Quantitative Research Intern, you will work side-by-side with our Research Team of mathematicians, scientists and technologists, to develop and enhance the models that drive Optiver’s trading. You will tackle a practical research project that has real-world impact and directly influences Optiver’s trading decisions. In our business, where the markets are always evolving, you will use your skills to predict its movements.
What you’ll do: Led by our in-house education team, you will delve into trading fundamentals and engage in a research project that makes a real-world impact. You will be paired with one of Optiver’s seasoned researchers, providing you exposure to a variety of research areas, including: • Using statistical models and machine learning to develop trading algorithms. • Leveraging big data technologies to analyze high-frequency trading strategies, market microstructure, and financial instruments to identify trading opportunities. • Building stochastic models to determine the fair value of financial derivatives. • Combining quantitative analysis and high-performance implementation to ensure the efficiency and accuracy of pricing engines and libraries.
What you’ll get: You’ll join a culture of collaboration and excellence, surrounded by curious thinkers and creative problem-solvers. Motivated by a passion for continuous improvement, you’ll thrive in a supportive, high-performing environment alongside talented colleagues, collectively tackling some of the toughest challenges in the financial markets.
In addition, you’ll receive: • The opportunity to work alongside best-in-class professionals from over 40 different countries • The opportunity to earn a return internship or full-time offer in Chicago, Austin, New York City, or Amsterdam based on performance • A highly-competitive internship compensation package • Optiver-covered flights, living accommodations, and commuting stipends • Extensive office perks, including breakfast, lunch and snacks, regular social events, clubs, sporting leagues and more
Who you are: • Currently enrolled in a PhD program in Mathematics, Statistics, Computer Science, Physics or a related STEM field with outstanding academic performance • Expected graduation between December 2026 and June 2028 • Available to intern during Summer 2026 • Open to full-time opportunities upon graduation in 2027 or 2028 • Solid foundation in mathematics, probability, and statistics • Excellent research, analytical, and modeling skills • Independent research experience • Proficiency in any programming language • Experience in machine learning, with practical applications in time-series analysis and pattern recognition • Strong interest in working in a fast-paced, collaborative environment • Fluent in English with strong written and verbal communication skills
Who we are: At Optiver, our mission is to improve the market by injecting liquidity, providing accurate pricing, increasing transparency and stabilising the market no matter the conditions. With a focus on continuous improvement, we prioritise safeguarding the health and efficiency of the markets for all participants. As one of the largest market making institutions, we are a respected partner on 100+ exchanges across the globe. Our differences are our edge. Optiver does not discriminate on the basis of race, religion, color, sex, gender identity, sexual orientation, age, physical or mental disability, or other legally protected characteristics.
Optiver is supportive of US immigration sponsorship for this role.
*We accept one application per role per year. If you have previously applied to this position during this season and have been unsuccessful, you can reapply once the next recruitment season begins in 2026.
Job Description At Emmi AI, we are redefining how industries innovate. Traditional simulations are slow, expensive, and computationally heavy. We make them fast, scalable, and intelligent! Our AI-powered physics architecture and models unlock real-time interaction, slashing simulation times from days to seconds.
The Opportunity Work directly with our founding team as the strategic right hand to our CEO, COO, and Chief Scientist. You'll help navigate Emmi AI's growth trajectory in the specialized AI-physics simulation landscape through a blend of strategic insight and hands-on execution.
Your Mission - Strategic Intelligence & Decision-Making – Conduct market analysis of the specialized physics AI landscape, translating technical developments into strategic positioning - Lead Cross-Functional Projects – Orchestrate a wide range of high-impact special projects - VC & Investor Relations – Lead preparation for funding rounds, create compelling investor materials, and manage relationships with our European and international investors - High-Stakes Execution – Drive critical initiatives from conception to completion, adapting quickly as our strategic landscape evolves
Job Requirements - Venture Experience – 5+ years in VC, deep tech startups, consulting, or similar analytical roles - Strategic Thinking – Proven ability to analyze complex markets and translate insights into actionable recommendations - Project Management – Project management expertise across multiple workstreams - International Perspective – Experience working across European markets and innovation ecosystems - Exceptional Communication – Ability to translate complex technical concepts for diverse stakeholders
What Sets You Apart - Analytical Depth – You excel at quantitative and qualitative analysis, uncovering insights others miss - Technical Curiosity – While not necessarily a developer, you understand technology well enough to engage meaningfully with our research team - Growth Mindset – You thrive in ambiguity and see challenges as opportunities to push boundaries - Continental Vision – You understand Europe's unique deep tech ecosystem and can help position Emmi for success across multiple markets - Execution Excellence – You deliver consistently high-quality work and can adapt quickly as priorities evolve