Talent.com
Research Scientist, Frontier, Zurich
Research Scientist, Frontier, ZurichDeepMind • Zürich, Zürich, Switzerland
Research Scientist, Frontier, Zurich

Research Scientist, Frontier, Zurich

DeepMind • Zürich, Zürich, Switzerland
Vor 13 Tagen
Stellenbeschreibung

Snapshot

At Google DeepMind we foster an environment where ambitious long-term research flourishes. Our team is tackling one of the hardest problems in modern AI : Post-training Frontier models. Unlike smaller models that can rely on distillation our frontier models require novel training signals to advance the state of the art. We are defining the horizontal recipesfrom revamping RL prompts to advancing Reward Models (RM) that allow these models to think better reason deeper and align more closely with human intent. We believe that mastering the feedback loop between user signals and model behavior is the key to breaking through current performance plateaus.

About Us

Artificial Intelligence could be one of humanitys most useful inventions. At Google DeepMind were a team of scientists engineers machine learning experts and more working together to advance the state of the art in artificial intelligence. We use our technologies for widespread public benefit and scientific discovery and collaborate with others on critical challenges ensuring safety and ethics are the highest priority.

The Role

We are seeking a Research Scientist or Engineer to lead the development of next-generation post-training recipes for this role you will move beyond standard tuning; you will architect the Reward Modeling and Reinforcement Learning strategies that define how our most capable models learn. You will focus specifically on hard capabilitiessuch as improving chain-of-thought reasoning and complex instruction followingwhere synthetic data and distillation fall short. You will work horizontally to ensure these recipes scale across text audio and multimodal domains establishing the gold standard for how Gemini evolves.

Key responsibilities :

  • Frontier Recipe Development : Design and validate novel post-training pipelines (SFT RLHF RLAIF) specifically for frontier-class models where no teacher model exists.
  • Advance Reward Modeling : Lead research into next-gen Reward Models including investigating new architectures reducing reward hacking and improving signal-to-noise ratios in preference data.
  • Unlock Thinking Capabilities : innovative methods to improve the models internal reasoning (chain-of-thought) focusing on correctness logic and self-correction in multi-step tasks.
  • Revamp RL Paradigms : critically re-evaluate and optimize RL prompts and feedback mechanisms to extract maximum performance from the underlying base models.
  • Solve the Flywheel Challenge : create robust mechanisms to turn user signals and interactions into training data that continuously improves the model without introducing regression or bias.

Horizontal Impact : collaborate across teams to apply these advanced recipes to various model sizes and modalities (e.g. Audio) ensuring consistent high-quality behavior.

About You

In order to set you up for success as a Research Scientist at Google DeepMind we look for the following skills and experience :

  • PhD in machine learning artificial intelligence or computer science (or equivalent practical experience).
  • Strong background in Large Language Models (LLMs) Reinforcement Learning (RL) or preference learning.
  • Research interest in aligning AI systems with human feedback and utility.
  • Familiarity with experiment design and analyzing large-scale user data.
  • Strong coding and communication skills.
  • Preferred requirements

  • Experience with RLHF (Reinforcement Learning from Human Feedback) or DPO (Direct Preference Optimization).
  • Experience building or improving reward models and conducting human evaluation studies.
  • A proven track record of publications in top-tier conferences (e.g. NeurIPS ICML ICLR).
  • Experience with Chain-of-Thought (CoT) reasoning research or process-based supervision.
  • Deep understanding and experience training models from scratch or using self-play / self-improvement techniques.
  • At Google DeepMind we value diversity of experience knowledge backgrounds and perspectives and harness these qualities to create extraordinary impact. We are committed to equal employment opportunity regardless of sex race religion or belief ethnic or national origin disability age citizenship marital domestic or civil partnership status sexual orientation gender identity pregnancy or related condition (including breastfeeding) or any other basis as protected by applicable law. If you have a disability or additional need that requires accommodation please do not hesitate to let us know.

    Required Experience :

    IC

    Key Skills

    Laboratory Experience,Machine Learning,Python,AI,Bioinformatics,C / C++,R,Biochemistry,Research Experience,Natural Language Processing,Deep Learning,Molecular Biology

    Employment Type : Full Time

    Experience : years

    Vacancy : 1

    Jobalert für diese Suche erstellen

    Research Scientist Frontier Zurich • Zürich, Zürich, Switzerland

    Ähnliche Stellen
    Senior Quantitative Researcher

    Senior Quantitative Researcher

    Swissblock Technologies AG • Zug, Switzerland
    Swissblock Technologies is a private investment firm dedicated to pioneering cryptocurrency integration with cross-asset investing. Applying a quantitative view to financial markets, we manage syste...Mehr anzeigen
    Zuletzt aktualisiert: vor 12 Tagen • Gesponsert
    Research Engineer - Generative Humanoid Motion Generation

    Research Engineer - Generative Humanoid Motion Generation

    Flexion Robotics • Zürich, ZH, CH
    Quick Apply
    At Flexion, we're building the intelligence layer powering the next generation of humanoid robots.Our mission is to accelerate the transition from fragile prototypes to real-world deployment of hum...Mehr anzeigen
    Zuletzt aktualisiert: vor über 30 Tagen
    Wissenschaftlich-Technische / -R Mitarbeiter / -In Gemüsebau

    Wissenschaftlich-Technische / -R Mitarbeiter / -In Gemüsebau

    Agroscope • Zürich, CH
    Diesen Beitrag können Sie leisten.Leiten des Projekts "Entwicklung von ganzheitlichen Bekämpfungsstrategien gegen den Falschen Mehltau bei verschiedenen Gemüsearten" auf operativer Ebene und Koordi...Mehr anzeigen
    Zuletzt aktualisiert: vor 1 Tag • Gesponsert
    Director, Search & Evaluation Early TA Lead / Scout (3 positions)

    Director, Search & Evaluation Early TA Lead / Scout (3 positions)

    3065 CSL Innovation • Zürich, CH
    CSL's R&D organization is accelerating innovation to deliver greater impact for patients.With a project-led structure and a focus on collaboration, we’re building a future-ready team that thriv...Mehr anzeigen
    Zuletzt aktualisiert: vor 8 Tagen
    Spezialist / in Für Forensische 3D Und Vr-Visualisierung, 80 - 100 %

    Spezialist / in Für Forensische 3D Und Vr-Visualisierung, 80 - 100 %

    Forensisches Institut Zürich • Zürich, CH
    Als führendes polizei-wissenschaftliches Kompetenzzentrum der Schweiz ist das FOR unter anderem zuständig für Spurensicherungen am Ereignisort, forensische Analysen im Labor sowie Berichterstattung...Mehr anzeigen
    Zuletzt aktualisiert: vor 14 Tagen • Gesponsert
    Account Manager Molecular Diagnostics

    Account Manager Molecular Diagnostics

    Abbott AG • Baar, CH
    Abbott is a global healthcare leader, creating breakthrough science to improve people’s health.We’re always looking towards the future, anticipating changes in medical science and technology.In thi...Mehr anzeigen
    Zuletzt aktualisiert: vor 1 Tag • Gesponsert
    Innovation Fellow Vision Deep Learning

    Innovation Fellow Vision Deep Learning

    Bundesamt für Meteorologie und Klimatologie MeteoSchweiz • Zürich-Flughafen (und Homeoffice), CH
    Zürich-Flughafen (und Homeoffice) |%.Diesen Beitrag können Sie leisten.Ein Deep-Learning Modell entwickeln und implementieren, welches die meteorologische Sichtweite aus Kamerabildern schätzt.Mit d...Mehr anzeigen
    Zuletzt aktualisiert: vor 15 Tagen • Gesponsert
    Research Engineer - State Estimation and Visual Odometry (ML+classic)

    Research Engineer - State Estimation and Visual Odometry (ML+classic)

    Flexion Robotics • Zürich, ZH, CH
    Quick Apply
    At Flexion, we're building the intelligence layer powering the next generation of humanoid robots.Our mission is to accelerate the transition from fragile prototypes to real-world humanoid deployme...Mehr anzeigen
    Zuletzt aktualisiert: vor über 30 Tagen
    Junior Full-Stack Data Scientist

    Junior Full-Stack Data Scientist

    Ergon Informatik AG • Zürich, Canton Zurich, Switzerland
    Salary : CHF 90’000 - 100’000 per year.Masterabschluss in Data Science (ETH / Uni / FH) oder vergleichbar.Interesse an datengetriebenen Ansätzen und Freude daran, diese in robuste Software zu überführen...Mehr anzeigen
    Zuletzt aktualisiert: vor 25 Tagen
    Sales Engineer MedTech 100% (m / w)

    Sales Engineer MedTech 100% (m / w)

    yellowshark • Andelfingen, ZH, Switzerland, CH
    Aktive Neukundenakquise und Ausbau des Kundenportfolios mit Fokus auf MedTech-Segmente.Gesamtverantwortung für Kunden und Leads inkl. Umsatz, Verträge, Offerten und Verhandlungen.Weiterentwicklung b...Mehr anzeigen
    Zuletzt aktualisiert: vor 9 Tagen • Gesponsert
    Dozent : in Extended Reality Im Schwerpunkt Visual Intelligence 80 - 100 %

    Dozent : in Extended Reality Im Schwerpunkt Visual Intelligence 80 - 100 %

    ZHAW • Winterthur, CH
    Sie erforschen und entwickeln innovative XR-Lösungen und unterrichten auf Bachelor und Master Stufe.Dozent : in Extended Reality (XR) im Schwerpunkt Visual Intelligence%. Junioder nach Vereinbarung (B...Mehr anzeigen
    Zuletzt aktualisiert: vor 14 Tagen • Gesponsert
    Dozent : in Bwl Und Wissenschaftliches Arbeiten

    Dozent : in Bwl Und Wissenschaftliches Arbeiten

    ZHAW • Winterthur, CH
    Wollen Sie die nächste Generation von Fach- und Führungskräften ausbilden? Ihnen Future Skills und den Umgang mit KI mit auf den Weg geben? Und das Ganze mit innovativen Lehr-Lern-Formaten? Dann si...Mehr anzeigen
    Zuletzt aktualisiert: vor 1 Tag • Gesponsert
    Internship - 3D Vision and Generative AI

    Internship - 3D Vision and Generative AI

    Flexion Robotics • Zürich, ZH, CH
    Quick Apply
    At Flexion, we're building the intelligence layer powering the next generation of humanoid robots.Our mission is to accelerate the transition from fragile prototypes to real-world deployment of hum...Mehr anzeigen
    Zuletzt aktualisiert: vor über 30 Tagen
    AI Engineer / Consultant Internship

    AI Engineer / Consultant Internship

    Artifact • Zürich Oerlikon, ZH, CH
    AI brings huge potential, but many companies are still struggling to release the desired impact.We are a pragmatic and dynamic team to help “empower our business clients with AI”.You, as a talent, ...Mehr anzeigen
    Zuletzt aktualisiert: vor über 30 Tagen
    Professorship For Evolutionary Anthropology And Primatology

    Professorship For Evolutionary Anthropology And Primatology

    Universität Zürich • Zürich, CH
    We are seeking candidates in the field of Comparative Evolutionary Anthropology and Primatology for the Irene Staehelin Endowed Professorship at the University of Zurich. This position focuses on th...Mehr anzeigen
    Zuletzt aktualisiert: vor 15 Tagen • Gesponsert
    Wissenschaftliche : n Mitarbeiter : in / Creative Research Engineer 60%,

    Wissenschaftliche : n Mitarbeiter : in / Creative Research Engineer 60%,

    Zürcher Hochschule der Künste • Zürich, CH
    Sie werden in erster Linie an Projekten und Kooperationen innerhalb des Immersive Arts Space arbeiten und haben die Möglichkeit, unabhängige Initiativen zu verfolgen, die mit dem thematischen Schwe...Mehr anzeigen
    Zuletzt aktualisiert: vor 12 Tagen • Gesponsert
    Senior AI Manager

    Senior AI Manager

    Visium SA • Zürich, ZH, CH
    Quick Apply
    Title : Expert Engagement Manager.Location : Switzerland (Lausanne or Zurich).At Visium, we enable enterprise executives in defining their AI & Data strategy, execute large scale transformations ...Mehr anzeigen
    Zuletzt aktualisiert: vor über 30 Tagen
    Regional Manager Switzerland

    Regional Manager Switzerland

    Assetara Limited • Zürich, Switzerland
    Quick Apply
    Assetara Limited is an international company specializing in AI-powered financial analytics and trading solutions.We strive for innovation, excellence, and expanding our global presence, relying on...Mehr anzeigen
    Zuletzt aktualisiert: vor über 30 Tagen