Talent.com
Research Engineer, Multimodal Reinforcement Learning
Research Engineer, Multimodal Reinforcement LearningDeepMind • Zürich, Zürich, Switzerland
Research Engineer, Multimodal Reinforcement Learning

Research Engineer, Multimodal Reinforcement Learning

DeepMind • Zürich, Zürich, Switzerland
Vor 4 Tagen
Stellenbeschreibung

Snapshot

Are you a Research Engineer with a passion for Reinforcement Learning and Multimodality Join Google DeepMinds Frontier AI Unit ! We are seeking a researcher to help us make learning efficient through conversational environments. While text-based reasoning has shown immense promise we are moving the frontier toward image-grounded multimodal and retrieval-augmented conversational setups. You will bridge the gap between conversational learning and the visual domain applying the latest RL methods to create scalable semi-verifiable environments that power the next generation of our models (e.g. Gemini).

About us

Google DeepMind : Artificial Intelligence could be one of humanitys most useful inventions. At Google DeepMind were working together to advance the state of the art in artificial intelligence. We use our technologies for widespread public benefit and scientific discovery and collaborate with others on critical challenges ensuring safety and ethics are the highest priority.

Frontier AI Unit : The Frontier AI Unit is responsible for building and scaling the next generation of our core models. Within this group our team focuses on conversationality as a mechanism for efficient learning. We believe that learning conversationally transfers between environments. We are moving beyond Chain-of-Thought (CoT) and text-only setups to build multimodal multi-turn reasoning capabilities leveraging an ecosystem of autoraters and autousers to scale environment creation.

The role

We have strong evidence that conversational environments lead to better learning in a transferable way. However we need to go beyond text. As a Research Engineer you will play a pivotal role in expanding Meta Reinforcement Learning to multimodal setups. You will help us leapfrog current industry benchmarks by extending our focus from verifiable domains to semi-verifiable multimodal domains (e.g. Lens Image-grounded reasoning).

This is an ecosystem play : you will leverage our advantages in autoraters and autousers to scale the creation of these conversational environments. You will be the bridge between the core conversational work and the specifics of grounding in the visual domain moving our training infra from static data towards dynamic multi-turn environments.

Key responsibilities

  • Multimodal RL Research : Design and implement novel RL algorithms that enable multi-turn reasoning and learning in multimodal (text vision) environments.
  • Environment Scaling : Contribute to the ecosystem of autoraters and autousers building the infrastructure needed to generate high-quality semi-verifiable training environments at scale.
  • Strategic Application : Apply state-of-the-art methods to solve strategic problems specifically closing the gap between single-turn and multi-turn embeddings (retrieval-augmented reasoning).
  • Experimentation & Analysis : track interpret and analyze complex experiments providing scientific rigor to our training pipelines.
  • Collaboration : Act as a connector between teams (Google Research Core GDM GenAI) helping to build shared pipelines for conversational infrastructure that serve product needs in Search Lens and YouTube.

What We Can Offer You

  • Scientific Contribution : The opportunity to publish and contribute to the scientific community specifically in the high-impact intersection of RL Multimodality and Reasoning.
  • Scale & Resources : Access to world-class compute and the existing infrastructure of autoraters / autousers allowing you to focus on innovation rather than building from scratch.
  • Direct Impact : Your work will directly influence the reasoning capabilities of Googles flagship models (Gemini) moving the needle on how models learn and interact with the world.
  • Collaborative Culture : Work alongside world-leading experts in RL and Generative AI in a supportive growth-oriented environment.
  • About you

    We are looking for a Research Engineer who is not just technically proficient but deeply curious about the mechanics of learning. You should be up to date with the latest methods in RL and eager to apply them to messy ambiguous and high-impact strategic problems. You are comfortable bridging the gap between abstract research and concrete implementation.

    Essential Skills :

  • PhD or Equivalent Experience : A PhD in Computer Science AI or related field or equivalent practical experience with a specific focus on Reinforcement Learning (RL).
  • Proven Research Track Record : A history of scientific contributions (e.g. publications at NeurIPS ICML ICLR CVPR) or significant contributions to state-of-the-art AI models.
  • Multimodal Experience : Concrete experience working with multimodal models (vision language) and understanding the specific challenges of grounding text in visual data.
  • Engineering Excellence : Strong coding skills (Python JAX / TensorFlow / PyTorch) and experience designing and executing complex experiments.
  • Useful Skills :

  • Retrieval & Embeddings : Experience with retrieval-augmented generation (RAG) embedding spaces or search infrastructure.
  • Multi-Agent Systems : Familiarity with self-verification introspection reflection or multi-agent negotiation frameworks.
  • Infrastructure : Experience building or scaling training environments autoraters or reward models.
  • At Google DeepMind we value diversity of experience knowledge backgrounds and perspectives and harness these qualities to create extraordinary impact. We are committed to equal employment opportunity regardless of sex race religion or belief ethnic or national origin disability age citizenship marital domestic or civil partnership status sexual orientation gender identity pregnancy or related condition (including breastfeeding) or any other basis as protected by applicable law. If you have a disability or additional need that requires accommodation please do not hesitate to let us know.

    Required Experience :

    IC

    Key Skills

    Robotics,Machine Learning,Python,AI,C / C++,OS Kernels,Research Experience,Matlab,Rust,Research & Development,Natural Language Processing,Tensorflow

    Employment Type : Full Time

    Experience : years

    Vacancy : 1

    Jobalert für diese Suche erstellen

    Research Engineer Multimodal Reinforcement Learning • Zürich, Zürich, Switzerland

    Ähnliche Stellen
    Requirements Engineer / Solution Designer - DMS

    Requirements Engineer / Solution Designer - DMS

    Coopers Group AG • Zürich, Zurich, Switzerland
    Homeoffice
    Quick Apply
    Für unseren Kunden aus der Bankenbranche in Zürich Süd, suchen wir eine : n erfahrene : n und aufgeschlossene : n.Requirements Engineer / Solution Designer - DMS. Analysieren und Erheben von Anforderungen...Mehr anzeigen
    Zuletzt aktualisiert: vor 4 Tagen
    Professorship For Evolutionary Anthropology And Primatology

    Professorship For Evolutionary Anthropology And Primatology

    Universität Zürich • Zürich, Zürich, Switzerland
    We are seeking candidates in the field of Comparative Evolutionary Anthropology and Primatology for the Irene Staehelin Endowed Professorship at the University of Zurich. This position focuses on th...Mehr anzeigen
    Zuletzt aktualisiert: vor 13 Tagen • Gesponsert
    ICT Senior System Engineer (m / w / d) 80–100%

    ICT Senior System Engineer (m / w / d) 80–100%

    Vision-Inside AG • Zug, Switzerland
    Begeistere unsere Kundinnen und Kunden : .Kundenzufriedenheit steht bei uns an erster Stelle.Du bist verantwortlich für die erfolgreiche Umsetzung unserer Kundenprojekte. Aufbau, Betrieb und Optimieru...Mehr anzeigen
    Zuletzt aktualisiert: vor über 30 Tagen • Gesponsert
    German Audio Collection Projects (Remote)

    German Audio Collection Projects (Remote)

    Sigma Group • Zúrich, CH
    Homeoffice
    Quick Apply
    Help shape the future of ethical AI.AI – Shaping the Future of Artificial Intelligence 🌍.Sigma is a leading global technology company specializing in data collection and annotation for Artificial ...Mehr anzeigen
    Zuletzt aktualisiert: vor 3 Tagen
    Observability & AIOps Engineer / Consultant

    Observability & AIOps Engineer / Consultant

    Digital Architects Zurich • Zürich, Zurich, Switzerland
    Quick Apply
    Expertinnen und Experten für Cloud-native und AI-driven Software-Delivery und Operations gesucht! Es warten spannende Projekte bei grossen und kleinen Kunden im Bereich Observability & AIOps s...Mehr anzeigen
    Zuletzt aktualisiert: vor 15 Tagen
    Product Engineer (100% Remote)

    Product Engineer (100% Remote)

    Tether Operations Limited • Zürich, ZH, CH
    Homeoffice
    Join Tether and Shape the Future of Digital Finance.At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from ex...Mehr anzeigen
    Zuletzt aktualisiert: vor 16 Tagen
    Director, Search & Evaluation Early TA Lead / Scout (3 positions)

    Director, Search & Evaluation Early TA Lead / Scout (3 positions)

    3065 CSL Innovation • Zürich, CH
    CSL's R&D organization is accelerating innovation to deliver greater impact for patients.With a project-led structure and a focus on collaboration, we’re building a future-ready team that thriv...Mehr anzeigen
    Zuletzt aktualisiert: vor 8 Tagen
    Research Engineer - State Estimation and Visual Odometry (ML+classic)

    Research Engineer - State Estimation and Visual Odometry (ML+classic)

    Flexion Robotics • Zürich, ZH, CH
    Quick Apply
    At Flexion, we're building the intelligence layer powering the next generation of humanoid robots.Our mission is to accelerate the transition from fragile prototypes to real-world humanoid deployme...Mehr anzeigen
    Zuletzt aktualisiert: vor über 30 Tagen
    Machine Learning Engineer (GCP, Terraform & AI)

    Machine Learning Engineer (GCP, Terraform & AI)

    PROSTAFF Schweiz GmbH • Zürich, Zurich, Switzerland
    Quick Apply
    Für ein strategisches Pricing-Programm im Privatkundengeschäft (P&C Pricing, DDC 4.Machine Learning Engineer, der datengetriebene Modelle produktiv einsetzt und echten Business Value schafft.D...Mehr anzeigen
    Zuletzt aktualisiert: vor 23 Tagen
    Senior AI Inference Engineer (llama.cpp specialist) 100% Remote

    Senior AI Inference Engineer (llama.cpp specialist) 100% Remote

    Tether Operations Limited • Zürich, ZH, CH
    Homeoffice
    Join Tether and Shape the Future of Digital Finance.At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from ex...Mehr anzeigen
    Zuletzt aktualisiert: vor 17 Tagen
    AI Product Experience Manager

    AI Product Experience Manager

    cogify ag • Zürich, ZH, CH
    Quick Apply
    Unser Ziel : Komplexe Prozesse in intuitive, agile und AI-gestützte Lösungen verwandeln.Wir wollen eine Plattform schaffen, die Unternehmen befähigt, Vibe Coding Enterprise tauglich einzusetzen.Du b...Mehr anzeigen
    Zuletzt aktualisiert: vor 12 Tagen
    IT System Engineer MS Cloud / Azure (m / w) 80-100%

    IT System Engineer MS Cloud / Azure (m / w) 80-100%

    yellowshark • Zug, ZG, Switzerland, CH
    Im Auftrag unseres Kunden, ein führender Software-, Cloud- und IT-Service-Dienstleister in der Schweiz, suchen wir zur Verstärkung des IT-Teams eine / n Junior IT System Engineer MS Cloud / MS 365 (m / w...Mehr anzeigen
    Zuletzt aktualisiert: vor 17 Tagen • Gesponsert
    Sales Engineer MedTech 100% (m / w)

    Sales Engineer MedTech 100% (m / w)

    yellowshark • Andelfingen, ZH, Switzerland, CH
    Aktive Neukundenakquise und Ausbau des Kundenportfolios mit Fokus auf MedTech-Segmente.Gesamtverantwortung für Kunden und Leads inkl. Umsatz, Verträge, Offerten und Verhandlungen.Weiterentwicklung b...Mehr anzeigen
    Zuletzt aktualisiert: vor 9 Tagen • Gesponsert
    Internship - 3D Vision and Generative AI

    Internship - 3D Vision and Generative AI

    Flexion Robotics • Zürich, ZH, CH
    Quick Apply
    At Flexion, we're building the intelligence layer powering the next generation of humanoid robots.Our mission is to accelerate the transition from fragile prototypes to real-world deployment of hum...Mehr anzeigen
    Zuletzt aktualisiert: vor über 30 Tagen
    AI Engineer / Consultant Internship

    AI Engineer / Consultant Internship

    Artifact • Zürich Oerlikon, ZH, CH
    AI brings huge potential, but many companies are still struggling to release the desired impact.We are a pragmatic and dynamic team to help “empower our business clients with AI”.You, as a talent, ...Mehr anzeigen
    Zuletzt aktualisiert: vor über 30 Tagen
    Produkt Manager Sensorik 100% (m / w)

    Produkt Manager Sensorik 100% (m / w)

    yellowshark • Zug, ZG, Switzerland, CH
    Unser Kunde entwickelt innovative Mess-, Automatisierungs- und datenbasierte Lösungen für Wasser- und Energieversorgung.Strategische Verantwortung für Produkte, Lösungen und Services in der Messtec...Mehr anzeigen
    Zuletzt aktualisiert: vor 8 Tagen • Gesponsert
    Internship - Humanoid Motion Generation (Diffusion and Flow Matching)

    Internship - Humanoid Motion Generation (Diffusion and Flow Matching)

    Flexion Robotics • Zürich, ZH, CH
    Quick Apply
    At Flexion, we're building the intelligence layer powering the next generation of humanoid robots.Our mission is to accelerate the transition from fragile prototypes to real-world deployment of hum...Mehr anzeigen
    Zuletzt aktualisiert: vor über 30 Tagen
    CAD AI Machine Learning / Deep Learning Engineer – Coding Wizard ( 70% 100% ) (remote / onsite)

    CAD AI Machine Learning / Deep Learning Engineer – Coding Wizard ( 70% 100% ) (remote / onsite)

    Imnoo AG • Zürich, ZH, CH
    Homeoffice
    Join us for a fulfilling career where you’ll tackle concretely defined, real-world challenges.At Imnoo, you’ll make a significant impact on automating manufacturing processes, gaining invaluable ex...Mehr anzeigen
    Zuletzt aktualisiert: vor 15 Stunden • Neu!