about 2 months ago
Lima, PeruMid Level / Senior
Responsibilities
- Develop, test, and deploy web scraping scripts and crawlers using Python tools.
- Architect and maintain asynchronous scraping systems for large-scale data extraction.
- Implement and optimize anti-blocking strategies and proxy rotation.
- Manage data ingestion pipelines and integrate with external REST APIs.
- Debug and improve scraper performance and data quality.
- Collaborate with engineers to enhance scraping infrastructure and monitoring systems.
- Assist with DevOps tasks, including Docker and CI/CD.
Requirements
- Proven experience in high-volume web scraping and data extraction using Python.
- Solid understanding of HTML parsing and browser automation techniques.
- Proficiency with web scraping frameworks like Playwright, Scrapy, or Selenium.
- Strong knowledge of REST APIs, HTTP protocols, and proxy management.
- Familiarity with SQL and NoSQL databases for data storage.
- Experience with Docker, Linux environments, and version control (Git).
- Fluent in English, both written and spoken.
- Self-driven and detail-oriented with project ownership capabilities.
Benefits
- High degree of freedom and impact on a growing scale-up business.
- Flexibility with remote work options.
- Competitive compensation and support for personal and professional development.
- Great work atmosphere within a small, talented, international team.
- Optional modern office located near Berlin.
