Talent.com
AI Engineer - Synthetic Data Generation
AI Engineer - Synthetic Data GenerationOmnilex • Zürich, Zurich, Switzerland
AI Engineer - Synthetic Data Generation

AI Engineer - Synthetic Data Generation

Omnilex • Zürich, Zurich, Switzerland
Vor einem Tag
Anstellungsart
  • Quick Apply
Stellenbeschreibung

🌟 About You

Do you get joy from turning messy legal texts into clean, structured, high-quality datasets that actually improve model behavior? Do you like building pipelines where every step is measurable : extraction quality, citation correctness, dedup rate, cost per item, throughput, and regression stability? Are you comfortable shipping pragmatic tooling (CLIs, validators, tests) around LLMs without hand-waving away edge cases? If so, we’d love to hear from you.

🚀 About Omnilex

Omnilex is a young dynamic AI legal tech startup with its roots at ETH Zurich. Our passionate interdisciplinary team of 10+ people is dedicated to empowering legal professionals in law firms and legal teams by leveraging the power of AI for legal research and answering complex legal questions. We already stand out with handling unique challenges, including our combination of external data, customer-internal data and our own innovative AI-first legal commentaries.

🧬 Your Mission : Synthetic Data for Legal AI

As an AI Engineer – Synthetic Data Generation , you will build and own pipelines that generate retrieval-ready and evaluation-grade synthetic datasets from real legal sources (court decisions, statutes, commentaries) across languages and jurisdictions, while keeping quality high and costs controlled.

Tasks

🛠️ Your Responsibilities

  • Build multi-step generation pipelines (10+ steps) : From DB selection → pseudonymization → extraction → translation → normalization → deduplication→ validation → classification → rating → export.
  • LLM integration, production-grade : Design robust prompt suites for extraction, translation, classification, and rating; enforce structured JSON outputs; handle retries, partial failures, and weird model behavior.
  • Quality assurance & filtering : Implement scoring systems (multi-criteria, consistent rubrics), dedup / near-dup suppression, and deterministic validators (especially for citations).
  • Citation processing at legal-grade precision : Extract, normalize, and validate citations across languages and formats (e.g., Art. 336c Abs. 1 OR , BGE 137 III 266 E. 3.2 ), including abbreviation mapping and normalization rules.
  • Cost & throughput optimization : Use batch APIs where appropriate, tune reasoning effort, control concurrency, count tokens, and keep runs cost-efficient (without sacrificing quality).
  • Developer tooling & CLI workflows : Build CLIs with progress tracking, configurable concurrency, and solid ergonomics for long-running jobs.
  • Testing across levels : Write unit / smoke / integration tests for pipelines and validators (including mocked LLMs where sensible and real API runs where needed).
  • Cross-team collaboration : Work closely with legal experts to define what “good” looks like for exam questions / commentaries, and translate that into measurable QA checks.

Requirements

✅ Minimum qualifications

  • Experience building backend / data tooling with TypeScript / Node.js (strict typing, generics, async patterns).
  • Hands-on experience integrating LLM APIs (OpenAI / Anthropic or similar), including structured outputs (JSON), prompt iteration, and failure handling.
  • Strong data pipeline mindset : ETL workflows, transformation steps, validation, and reproducibility.
  • Solid SQL / PostgreSQL skills and experience with an ORM (bonus if Drizzle ).
  • Experience writing reliable tests (e.g., Jest ) and maintaining CI-friendly pipelines.
  • Fluent English; willing to work hybrid in Zurich (on-site at least two days / week), full-time.
  • 🎯 Preferred qualifications

  • Familiarity with the Swiss legal system (court structure, citation norms, multilingual legal terminology).
  • Working proficiency in German ; plus French / Italian is a strong advantage.
  • Experience with batch processing and cost-aware LLM operations (token budgeting, batching strategy, caching, early-exit).
  • Practical text processing skills : regex-heavy parsing, dedup / near-dup detection, similarity search (e.g., BM25 / MiniSearch).
  • Familiarity with our environment : Yarn workspaces / monorepos , NestJS , and pragmatic CLI tooling.
  • Benefits

    🤝 Benefits

  • Direct impact : Your datasets will directly shape model quality and evaluation reliability in legal research and reasoning.
  • Autonomy & ownership : Own the synthetic data pipeline end-to-end; prompts, validators, QA, exports, and cost controls.
  • Team : Work with a sharp interdisciplinary group at the intersection of AI, engineering, and law.
  • Compensation : CHF 7’000–11’000 per month + ESOP , depending on experience and skills.
  • We’re excited to hear from candidates who love building robust, cost-aware LLM pipelines and care about precision (especially when citations and multilingual legal nuance matter). Apply today by pressing the Apply button.

    Jobalert für diese Suche erstellen

    AI Engineer Synthetic Data Generation • Zürich, Zurich, Switzerland

    Ähnliche Stellen
    AI & Data Engineer

    AI & Data Engineer

    UMB AG, Zweigniederlassung Volketswil • Volketswil, CH
    Wir sind ein grossartiges Team und werden immer wieder von Great Place to Work als beste Arbeitgeberin ausgezeichnet.Dir fehlt Wertschätzung? Wir sind bekannt für unsere positive Feedback-Kultur.Du...Mehr anzeigen
    Zuletzt aktualisiert: vor 9 Tagen • Gesponsert
    Wir suchen : System Engineer

    Wir suchen : System Engineer

    COMED AG • Altdorf, UR, Switzerland
    Du bist ein erfahrener System Engineer und suchst nach einer neuen Herausforderung? Du willst Dich beruflich und persönlich weiterentwickeln? Dann bist Du bei uns genau richtig!.Du analysierst, kon...Mehr anzeigen
    Zuletzt aktualisiert: vor 12 Tagen • Gesponsert
    Working Student PowerBI Development & Data Analytics40-60%

    Working Student PowerBI Development & Data Analytics40-60%

    Siemens Schweiz AG • Zug, CH
    Together with our customers, we combine the real and digital worlds Siemens is a leading technology company (employing around 6,000 people in Switzerland / 320,000 globally).We provide pioneering s...Mehr anzeigen
    Zuletzt aktualisiert: vor 18 Tagen • Gesponsert
    Lead Data Engineer (Azure, Fabric)

    Lead Data Engineer (Azure, Fabric)

    Visium SA • Zürich, ZH, CH
    Quick Apply
    Location : All Visium locations.At Visium, we enable enterprise executives in defining their AI & Data strategy, execute large scale transformations and implement AI across operations, ensuring ...Mehr anzeigen
    Zuletzt aktualisiert: vor 16 Tagen
    Senior Data Platform Engineer / Lead Data Engineer

    Senior Data Platform Engineer / Lead Data Engineer

    Next-Link • Zürich, BE, ch
    Quick Apply
    Senior Data Platform Engineer / Lead Data Engineer.The role involves leading legacy-to-cloud migrations, driving data transformation initiatives, and collaborating closely with cross-functional tea...Mehr anzeigen
    Zuletzt aktualisiert: vor 11 Tagen
    Master Data Specialist 100% (m / w / d)

    Master Data Specialist 100% (m / w / d)

    Universal-Job AG • Schaffhausen, CH
    Als Master Data Specialist bist du die treibende Kraft hinter der Sicherstellung der Qualität und Vollständigkeit von Stammdaten im ERP- oder MDM-System. Pflege, Aktualisierung und Qualitätssicherun...Mehr anzeigen
    Zuletzt aktualisiert: vor 24 Tagen • Gesponsert
    Principal AI Engineer (Europe-based, remote)

    Principal AI Engineer (Europe-based, remote)

    Sherpany by Datasite • Zürich, Zurich, Switzerland
    Homeoffice
    Quick Apply
    Sherpany by Datasite is the leading Swiss meeting management solution, designed to meet the unique needs of the board, board committee, and executive meetings. Our solution streamlines the entire me...Mehr anzeigen
    Zuletzt aktualisiert: vor über 30 Tagen
    Analyst QC FF LM (m / w / d) / / Johnson & Johnson

    Analyst QC FF LM (m / w / d) / / Johnson & Johnson

    Randstad (Schweiz) AG • Schaffhausen
    Für die Welt sorgen … beim Einzelnen beginnen.Dieser Leitsatz inspiriert und eint die Menschen bei Johnson & Johnson.Dank den innovativen Produkten, Prozessen und Technologien gehört Johnson & John...Mehr anzeigen
    Zuletzt aktualisiert: vor über 30 Tagen • Gesponsert
    DevOps Engineer «Plattform Microservices»

    DevOps Engineer «Plattform Microservices»

    WILHELM AG • Zug, CH
    DevOps Engineer «Plattform Microservices» (m / w / d)Ihre neue Herausforderung in einem modernen, agilen VerwaltungsumfeldDer Kanton Zug gehört zu den innovativsten öffentlichen Verwaltungen der Schwei...Mehr anzeigen
    Zuletzt aktualisiert: vor 2 Tagen • Gesponsert
    Senior AI Engineer

    Senior AI Engineer

    Noser Engineering AG • Zürich, CH
    Gemeinsam zukunftsfähige Lösungen gestaltenMit über 220 Ingenieuren und Consultants zählen wir seit 40 Jahren zu den erfolgreichsten Schweizer Unternehmen in der technischen Informatik.Für unsere n...Mehr anzeigen
    Zuletzt aktualisiert: vor 2 Tagen • Gesponsert
    AI Engineer / Consultant

    AI Engineer / Consultant

    Artifact • Zürich Oerlikon, ZH, CH
    AI brings huge potential, but many companies are still struggling to release the desired impact.We are a pragmatic and dynamic team to help “empower our business clients with AI”.You, as a talent, ...Mehr anzeigen
    Zuletzt aktualisiert: vor über 30 Tagen
    Senior AI / ML Platform Engineer

    Senior AI / ML Platform Engineer

    IMTF • Zürich, ZH, CH
    Quick Apply
    As a leading global software product company in the field of Compliance & Automation, IMTF Group develops cutting-edge RegTech solutions with a focus on Anti-Money Laundering and Anti-Fraud.We ...Mehr anzeigen
    Zuletzt aktualisiert: vor 13 Tagen
    Sales Engineer MedTech 100% (m / w)

    Sales Engineer MedTech 100% (m / w)

    yellowshark • Andelfingen, ZH, Switzerland, CH
    Aktive Neukundenakquise und Ausbau des Kundenportfolios mit Fokus auf MedTech-Segmente.Gesamtverantwortung für Kunden und Leads inkl. Umsatz, Verträge, Offerten und Verhandlungen.Weiterentwicklung b...Mehr anzeigen
    Zuletzt aktualisiert: vor 24 Tagen • Gesponsert
    Supervisor Microbiology I 100 %

    Supervisor Microbiology I 100 %

    Ophtapharm AG • Hettlingen CH, CH
    Die Ophtapharm AG ist eine FDA / EU-GMP zertifizierte Produzentin im Pharmabereich (Ophthalmika).An unserem Produktionsstandort in Hettlingen bei Winterthur werden für internationale Märkte qualitati...Mehr anzeigen
    Zuletzt aktualisiert: vor 17 Tagen • Gesponsert
    Azure Data & AI Engineer (Zurich, Switzerland)

    Azure Data & AI Engineer (Zurich, Switzerland)

    Unit8 SA • Zürich, ZH, CH
    Quick Apply
    Founded in 2017, Unit8 is a fast-growing Swiss AI and data analytics consulting and services company dedicated to solving complex problems of traditional industries like automotive, chemical, finan...Mehr anzeigen
    Zuletzt aktualisiert: vor über 30 Tagen
    Senior Data Engineer (m / w / x)

    Senior Data Engineer (m / w / x)

    Bring! Labs AG • Zürich, Zurich, Switzerland
    Quick Apply
    We build the perfect shopping companions!.We help brands and retailers to reach out to their existing and future customers. We provide them with the most relevant advertising platform to showcase an...Mehr anzeigen
    Zuletzt aktualisiert: vor 14 Tagen
    Software Engineer (Deutschschweiz)

    Software Engineer (Deutschschweiz)

    Arctive AG • Zug, Canton Zug, Switzerland
    Salary : CHF 85’000 - 105’000 per year.Wir passen zusammen, wenn Du Folgendes mitbringst : .Mehrjährige Erfahrung bei der Umsetzung und Einführung von IT-Lösungen. Lust auf Teamarbeit und Cloud-Technol...Mehr anzeigen
    Zuletzt aktualisiert: vor über 30 Tagen • Gesponsert
    Data Engineer : in 80 – 100%

    Data Engineer : in 80 – 100%

    BlueCare AG • Winterthur, CH
    Seit mehr als 25 Jahren entwickeln wir bei der BlueCare AG von Winterthur aus wegweisende eHealth-Lösungen in enger Zusammenarbeit mit Arztpraxen, Managed Care-Organisationen und Versicherungen.Als...Mehr anzeigen
    Zuletzt aktualisiert: vor 3 Tagen • Gesponsert