Location: San Francisco, CA   |   Full-Time   |   $100,000 - $150,000
Python Linux PostgreSQL Dagster SQL Unix Web Crawling Data Pipeline Data Cleaning Docker Kubernetes Full Stack Mobile Development Statistics Back End Engineer Data Engineer
Company: Spice Data (YC S19) licenses data to leading Fortune 500 restaurants, focusing on sourcing (web crawling), cleaning, and formatting large datasets (150M+ points/month). We're a small, profitable, SF-based team founded in 2019.

Role: Join our small and nimble engineering team in downtown San Francisco (On Site required a few times per week).

Responsibilities:
- Create and manage data collection scripts using HTTP requests, browser automation, and mobile app automation.
- Build and maintain automation/scheduling tooling (e.g., Dagster) for timely data collection.
- Create data cleaning and normalization scripts, with potential for ML/LLM integration.
- Design data analytics dashboards and tooling for monitoring data quality.
- Assist with miscellaneous DevOps tasks for managing infrastructure.

Ideal Candidate:
- Excited to work on varied projects, sometimes concurrently.
- Able to execute with minimal supervision.
- Experience building/maintaining data pipelines or working on web crawling/scraping.
- Familiarity with Unix-like systems is essential (terminal-based tooling).
- New grads are welcome.

Required Skills:
- Python
- SQL
- Unix

Bonus Skills:
- Web Crawling
- Docker
- Kubernetes
- Full Stack Web Development
- Mobile App Development
- Background in Statistics

Tech Stack: Python, Linux, PostgreSQL, Dagster

Benefits:
- Lunch provided when in office
- Unlimited PTO
- 401k
- Company paid Platinum PPO health and comparable dental & vision insurance
- Competitive equity (0.25% - 1.00%)
Post Date: April 23, 2025