Hi Flyte community! 👋
I’m
James, Head of Engineering at Elicit (I’ve also led/advised ML & Engineering teams at Outschool, Spring, and Square). Posting here because I’m excited to be hiring Elicit’s first Data Engineer!
Elicit is an AI research assistant built for professional researchers and high-stakes decision makers. Instead of just producing LLM slop in a chat bubble, Elicit helps users break down hard questions, gather evidence from scientific/academic sources, and reason through uncertainty.
We use
Flyte and PySpark to orchestrate our ML pipelines that process a couple hundred million academic/research papers: you will initially own and improve this infrastructure. Moving on from there, you’ll figure out how we can scale our data platform to ingest other structured and unstructured documents, spreadsheets and presentations, and rich media like audio and video. We also need your help with ML-adjacent tasks like preparing datasets for fine-tuning our models. Big picture, your work will help accelerate data ingestion, enable secure enterprise integrations, extend Elicit to work over any kind of data corpus, and more.
We’re a small Series A company (about 20 people so far), building AI systems that push the boundaries of how LLMs can be useful. We’ve been thinking about harnessing powerful AI since even before GPT-1 was trained. We need engineers who care about making AI more useful, trustworthy, and aligned with how people actually make tough decisions.
I’ll add more information about Elicit in the thread. If you're interested in applying your Flyte expertise to help build AI systems that accelerate scientific progress, please check out the
job description—I’m happy to answer any questions or discuss how we’re utilising Flyte in our stack!
Best,
James