Help keep our datasets trustworthy at web-scale. You’ll write SQL to profile historical data, build Python checks for anomalies/regressions, define data quality KPIs, and surface results via lightweight reports. Partner with Data Eng to validate ETL, document data contracts, and triage issues—using version-controlled QA assets and repeatable workflows. You’re a BS/MS student with solid SQL + Python, Git fluency, and a systematic, detail-oriented mindset.

We are a technology product startup that is transforming the job market and career personalization space. Our advanced data mining techniques, computing optimizations, and state-of-the-art Natural Language Processing (NLP) techniques, including transformer and LLM-based architectures, enable large-scale processing of job market data*. We help job seekers find jobs that are the best fit for their skills and experience, and identify skill and credential gaps for their dream jobs.
*Our database already surpasses the size of the entire Wikipedia corpus by over 100 times. We are targeting an ambitious scaling of our data by 10 to 100 times this year..
📍 Location: Remote