Research Engineer, Multimodal Reinforcement LearningDeepMind • Zürich, Zürich, Switzerland

Research Engineer, Multimodal Reinforcement Learning

DeepMind • Zürich, Zürich, Switzerland

Vor 4 Tagen

Stellenbeschreibung

Snapshot

Are you a Research Engineer with a passion for Reinforcement Learning and Multimodality Join Google DeepMinds Frontier AI Unit ! We are seeking a researcher to help us make learning efficient through conversational environments. While text-based reasoning has shown immense promise we are moving the frontier toward image-grounded multimodal and retrieval-augmented conversational setups. You will bridge the gap between conversational learning and the visual domain applying the latest RL methods to create scalable semi-verifiable environments that power the next generation of our models (e.g. Gemini).

About us

Google DeepMind : Artificial Intelligence could be one of humanitys most useful inventions. At Google DeepMind were working together to advance the state of the art in artificial intelligence. We use our technologies for widespread public benefit and scientific discovery and collaborate with others on critical challenges ensuring safety and ethics are the highest priority.

Frontier AI Unit : The Frontier AI Unit is responsible for building and scaling the next generation of our core models. Within this group our team focuses on conversationality as a mechanism for efficient learning. We believe that learning conversationally transfers between environments. We are moving beyond Chain-of-Thought (CoT) and text-only setups to build multimodal multi-turn reasoning capabilities leveraging an ecosystem of autoraters and autousers to scale environment creation.

The role

We have strong evidence that conversational environments lead to better learning in a transferable way. However we need to go beyond text. As a Research Engineer you will play a pivotal role in expanding Meta Reinforcement Learning to multimodal setups. You will help us leapfrog current industry benchmarks by extending our focus from verifiable domains to semi-verifiable multimodal domains (e.g. Lens Image-grounded reasoning).

This is an ecosystem play : you will leverage our advantages in autoraters and autousers to scale the creation of these conversational environments. You will be the bridge between the core conversational work and the specifics of grounding in the visual domain moving our training infra from static data towards dynamic multi-turn environments.

Key responsibilities

Multimodal RL Research : Design and implement novel RL algorithms that enable multi-turn reasoning and learning in multimodal (text vision) environments.
Environment Scaling : Contribute to the ecosystem of autoraters and autousers building the infrastructure needed to generate high-quality semi-verifiable training environments at scale.
Strategic Application : Apply state-of-the-art methods to solve strategic problems specifically closing the gap between single-turn and multi-turn embeddings (retrieval-augmented reasoning).
Experimentation & Analysis : track interpret and analyze complex experiments providing scientific rigor to our training pipelines.
Collaboration : Act as a connector between teams (Google Research Core GDM GenAI) helping to build shared pipelines for conversational infrastructure that serve product needs in Search Lens and YouTube.

What We Can Offer You

Scientific Contribution : The opportunity to publish and contribute to the scientific community specifically in the high-impact intersection of RL Multimodality and Reasoning.

Scale & Resources : Access to world-class compute and the existing infrastructure of autoraters / autousers allowing you to focus on innovation rather than building from scratch.

Direct Impact : Your work will directly influence the reasoning capabilities of Googles flagship models (Gemini) moving the needle on how models learn and interact with the world.

Collaborative Culture : Work alongside world-leading experts in RL and Generative AI in a supportive growth-oriented environment.

About you

We are looking for a Research Engineer who is not just technically proficient but deeply curious about the mechanics of learning. You should be up to date with the latest methods in RL and eager to apply them to messy ambiguous and high-impact strategic problems. You are comfortable bridging the gap between abstract research and concrete implementation.

Essential Skills :

PhD or Equivalent Experience : A PhD in Computer Science AI or related field or equivalent practical experience with a specific focus on Reinforcement Learning (RL).

Proven Research Track Record : A history of scientific contributions (e.g. publications at NeurIPS ICML ICLR CVPR) or significant contributions to state-of-the-art AI models.

Multimodal Experience : Concrete experience working with multimodal models (vision language) and understanding the specific challenges of grounding text in visual data.

Engineering Excellence : Strong coding skills (Python JAX / TensorFlow / PyTorch) and experience designing and executing complex experiments.

Useful Skills :

Retrieval & Embeddings : Experience with retrieval-augmented generation (RAG) embedding spaces or search infrastructure.

Multi-Agent Systems : Familiarity with self-verification introspection reflection or multi-agent negotiation frameworks.

Infrastructure : Experience building or scaling training environments autoraters or reward models.

At Google DeepMind we value diversity of experience knowledge backgrounds and perspectives and harness these qualities to create extraordinary impact. We are committed to equal employment opportunity regardless of sex race religion or belief ethnic or national origin disability age citizenship marital domestic or civil partnership status sexual orientation gender identity pregnancy or related condition (including breastfeeding) or any other basis as protected by applicable law. If you have a disability or additional need that requires accommodation please do not hesitate to let us know.

Required Experience :

Key Skills

Robotics,Machine Learning,Python,AI,C / C++,OS Kernels,Research Experience,Matlab,Rust,Research & Development,Natural Language Processing,Tensorflow

Employment Type : Full Time

Experience : years

Vacancy : 1

Jobalert für diese Suche erstellen

Research Engineer Multimodal Reinforcement Learning • Zürich, Zürich, Switzerland

Ähnliche Stellen

Requirements Engineer / Solution Designer - DMS

Coopers Group AG • Zürich, Zurich, Switzerland

Homeoffice

Quick Apply

Für unseren Kunden aus der Bankenbranche in Zürich Süd, suchen wir eine : n erfahrene : n und aufgeschlossene : n.Requirements Engineer / Solution Designer - DMS. Analysieren und Erheben von Anforderungen...Mehr anzeigen

Zuletzt aktualisiert: vor 4 Tagen

Professorship For Evolutionary Anthropology And Primatology

Universität Zürich • Zürich, Zürich, Switzerland

We are seeking candidates in the field of Comparative Evolutionary Anthropology and Primatology for the Irene Staehelin Endowed Professorship at the University of Zurich. This position focuses on th...Mehr anzeigen

Zuletzt aktualisiert: vor 13 Tagen • Gesponsert

ICT Senior System Engineer (m / w / d) 80–100%

Vision-Inside AG • Zug, Switzerland

Begeistere unsere Kundinnen und Kunden : .Kundenzufriedenheit steht bei uns an erster Stelle.Du bist verantwortlich für die erfolgreiche Umsetzung unserer Kundenprojekte. Aufbau, Betrieb und Optimieru...Mehr anzeigen

Zuletzt aktualisiert: vor über 30 Tagen • Gesponsert

German Audio Collection Projects (Remote)

Sigma Group • Zúrich, CH

Homeoffice

Quick Apply

Help shape the future of ethical AI.AI – Shaping the Future of Artificial Intelligence 🌍.Sigma is a leading global technology company specializing in data collection and annotation for Artificial ...Mehr anzeigen

Zuletzt aktualisiert: vor 3 Tagen

Observability & AIOps Engineer / Consultant

Digital Architects Zurich • Zürich, Zurich, Switzerland

Quick Apply

Expertinnen und Experten für Cloud-native und AI-driven Software-Delivery und Operations gesucht! Es warten spannende Projekte bei grossen und kleinen Kunden im Bereich Observability & AIOps s...Mehr anzeigen

Zuletzt aktualisiert: vor 15 Tagen

Product Engineer (100% Remote)

Tether Operations Limited • Zürich, ZH, CH

Homeoffice

Join Tether and Shape the Future of Digital Finance.At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from ex...Mehr anzeigen

Zuletzt aktualisiert: vor 16 Tagen

Director, Search & Evaluation Early TA Lead / Scout (3 positions)

3065 CSL Innovation • Zürich, CH

CSL's R&D organization is accelerating innovation to deliver greater impact for patients.With a project-led structure and a focus on collaboration, we’re building a future-ready team that thriv...Mehr anzeigen

Zuletzt aktualisiert: vor 8 Tagen

Research Engineer - State Estimation and Visual Odometry (ML+classic)

Flexion Robotics • Zürich, ZH, CH

Quick Apply

At Flexion, we're building the intelligence layer powering the next generation of humanoid robots.Our mission is to accelerate the transition from fragile prototypes to real-world humanoid deployme...Mehr anzeigen

Zuletzt aktualisiert: vor über 30 Tagen

Machine Learning Engineer (GCP, Terraform & AI)

PROSTAFF Schweiz GmbH • Zürich, Zurich, Switzerland

Quick Apply

Für ein strategisches Pricing-Programm im Privatkundengeschäft (P&C Pricing, DDC 4.Machine Learning Engineer, der datengetriebene Modelle produktiv einsetzt und echten Business Value schafft.D...Mehr anzeigen

Zuletzt aktualisiert: vor 23 Tagen

Senior AI Inference Engineer (llama.cpp specialist) 100% Remote

Tether Operations Limited • Zürich, ZH, CH

Homeoffice

Zuletzt aktualisiert: vor 17 Tagen

AI Product Experience Manager

cogify ag • Zürich, ZH, CH

Quick Apply

Unser Ziel : Komplexe Prozesse in intuitive, agile und AI-gestützte Lösungen verwandeln.Wir wollen eine Plattform schaffen, die Unternehmen befähigt, Vibe Coding Enterprise tauglich einzusetzen.Du b...Mehr anzeigen

Zuletzt aktualisiert: vor 12 Tagen

IT System Engineer MS Cloud / Azure (m / w) 80-100%

yellowshark • Zug, ZG, Switzerland, CH

Im Auftrag unseres Kunden, ein führender Software-, Cloud- und IT-Service-Dienstleister in der Schweiz, suchen wir zur Verstärkung des IT-Teams eine / n Junior IT System Engineer MS Cloud / MS 365 (m / w...Mehr anzeigen

Zuletzt aktualisiert: vor 17 Tagen • Gesponsert

Sales Engineer MedTech 100% (m / w)

yellowshark • Andelfingen, ZH, Switzerland, CH

Aktive Neukundenakquise und Ausbau des Kundenportfolios mit Fokus auf MedTech-Segmente.Gesamtverantwortung für Kunden und Leads inkl. Umsatz, Verträge, Offerten und Verhandlungen.Weiterentwicklung b...Mehr anzeigen

Zuletzt aktualisiert: vor 9 Tagen • Gesponsert

Internship - 3D Vision and Generative AI

Flexion Robotics • Zürich, ZH, CH

Quick Apply

Zuletzt aktualisiert: vor über 30 Tagen

AI Engineer / Consultant Internship

Artifact • Zürich Oerlikon, ZH, CH

AI brings huge potential, but many companies are still struggling to release the desired impact.We are a pragmatic and dynamic team to help “empower our business clients with AI”.You, as a talent, ...Mehr anzeigen

Zuletzt aktualisiert: vor über 30 Tagen

Produkt Manager Sensorik 100% (m / w)

yellowshark • Zug, ZG, Switzerland, CH

Unser Kunde entwickelt innovative Mess-, Automatisierungs- und datenbasierte Lösungen für Wasser- und Energieversorgung.Strategische Verantwortung für Produkte, Lösungen und Services in der Messtec...Mehr anzeigen

Zuletzt aktualisiert: vor 8 Tagen • Gesponsert

Internship - Humanoid Motion Generation (Diffusion and Flow Matching)

Flexion Robotics • Zürich, ZH, CH

Quick Apply

Zuletzt aktualisiert: vor über 30 Tagen

CAD AI Machine Learning / Deep Learning Engineer – Coding Wizard ( 70% 100% ) (remote / onsite)

Imnoo AG • Zürich, ZH, CH

Homeoffice

Join us for a fulfilling career where you’ll tackle concretely defined, real-world challenges.At Imnoo, you’ll make a significant impact on automating manufacturing processes, gaining invaluable ex...Mehr anzeigen

Zuletzt aktualisiert: vor 15 Stunden • Neu!