
Director of Software Engineering (Node.js & Web Scraping Expert)
Roman Health Pharmacy LLC, Los Angeles, CA, United States
We are seeking a Director of Software Engineering with deep expertise in Node.js development and large‑scale web scraping . This role will lead the engineering team , designing and optimizing high‑performance, distributed web scraping systems . The ideal candidate has extensive experience in handling anti‑bot measures, data pipeline optimization, and scalable cloud‑based architectures .
Key Responsibilities – Software Engineering & Web Scraping Leadership
- Architect, develop, and maintain scalable and distributed web scraping systems using Node.js .
- Design and implement data extraction pipelines to process large volumes of structured and unstructured data.
- Develop solutions to bypass anti‑bot mechanisms , including CAPTCHA handling, session management, fingerprinting, and IP rotation .
- Optimize scraping processes for performance, reliability, and efficiency while managing proxy services (residential, datacenter, rotating). Oversee data storage and processing strategies , ensuring high availability and consistency.
- Collaborate with Product, DevOps, and Data Science teams to integrate extracted data into analytics and business applications.
- Implement best practices for microservices, API integrations, and real‑time data streaming .
Key Responsibilities – Scalability, Security & DevOps
- Lead the transition to cloud‑native, containerized, and serverless architectures for web scraping.
- Ensure compliance with legal and ethical standards (robots.txt, GDPR, CCPA, etc.). Optimize cloud resources (AWS, GCP, or Azure) to support high‑throughput scraping.
- Manage real‑time monitoring and alerting systems to detect scraping failures, IP bans, or performance bottlenecks .
- Work closely with DevOps teams to optimize CI/CD pipelines, automated deployments, and system scalability .
Key Resposnibilities – Engineering Team Management & Strategy
- Lead, mentor, and grow a high‑performance engineering team .
- Define and execute the technology roadmap , aligning with business objectives.
- Foster a culture of continuous learning, collaboration, and innovation .
- Implement agile development methodologies (Scrum, Kanban) to optimize project execution.
- Ensure code quality, security, and best practices across all engineering efforts.
Qualifications & Experience – Technical Expertise
- 10+ years of experience in software engineering , with at least 5+ years in web scraping and large‑scale data extraction .
- Strong hands‑on expertise in Node.js, Puppeteer, Playwright, Cheerio, Selenium, and headless browser automation .
- Extensive experience in handling CAPTCHAs, IP rotation, session management, and anti‑bot evasion techniques .
- Deep knowledge of proxy management (residential, datacenter, rotating, and VPNs) . Experience with NoSQL/SQL databases (MongoDB, PostgreSQL, Redis, Elasticsearch, etc.) .
- Familiarity with data processing frameworks (Kafka, RabbitMQ, Spark, Airflow, etc.) Strong experience with CI/CD, containerization (Docker, Kubernetes), and cloud deployment (AWS/GCP/Azure) .
Qualifications & Experience – Leadership & Soft Skills
- Proven track record of scaling engineering teams and leading complex projects .
- Strong problem‑solving and debugging skills, especially for scraping challenges and performance bottlenecks .
- Excellent communication and stakeholder management skills.
- Passion for mentorship, team development, and continuous learning .
Preferred Qualifications
- Experience with machine learning for data extraction and NLP .
- Knowledge of browser fingerprinting and bot detection mechanisms .
- Familiarity with enterprise‑scale web crawling frameworks (Scrapy, Colly, Apify, etc.).
- Prior leadership experience in data‑driven businesses or web scraping startups .