Data Engineer
Remote-first, Europe and Americas overlap
Build the ingestion, curation, and artifact pipelines that keep omegaXiv research runs reproducible and searchable.
omegaXiv is building a public marketplace for automated research, reproducible runs, and transparent review. Right now our hiring focus is narrow: data engineering, machine learning engineering, and infrastructure engineering.
We publish public artifacts, reviews, and research history. The work needs to stand up to inspection, not only demos.
We prefer explicit contracts, reproducible runs, and predictable operations over clever hidden behavior.
The point is not novelty by itself. We care about tools and pipelines that move real scientific work forward.
We are deliberately staying narrow. If your background maps closely to one of the roles below, that is where we want to talk.
Remote-first, Europe and Americas overlap
Build the ingestion, curation, and artifact pipelines that keep omegaXiv research runs reproducible and searchable.
Remote-first, Europe and Americas overlap
Turn research pipeline outputs into reliable ranking, retrieval, and evaluation systems that improve what omegaXiv runs next.
Remote-first, Europe and Americas overlap
Build the runtime, deployment, and observability foundations that let omegaXiv run heavy research workloads without losing reliability.
If you want to suggest improvements to the product, hiring copy, or role definitions, use the repository and open a GitHub change proposal there.
Open the feedback repository