Python-Java Scraping Automation Framework
Summary
Freelancer Client is hiring: Python-Java Scraping Automation Framework.
Location: Remote
I need a compact framework that combines robust data scraping with hands-off process automation. The work centres on pulling accurate product details and then running follow-up tasks (such as data cleaning, scheduling, or routing to downstream APIs) without manual intervention.
What you'll do:
• Scrapers that reliably collect product titles, prices, SKUs, images, and stock status, delivered as modular Python scripts or a Scrapy project.
Skills: Java, Python, Website Design, Data Processing, Mobile App Development, Web Scraping, Software Architecture, Data Scraping, Automation, API Integration
Budget: $15–$25 USD
Source: Freelancer Client via Remote / Online. Apply on the source website.
Original
Hey;
I need a compact framework that combines robust data scraping with hands-off process automation. The work centres on pulling accurate product details and then running follow-up tasks (such as data cleaning, scheduling, or routing to downstream APIs) without manual intervention.
Python will power the scraping layer—think requests, BeautifulSoup, Scrapy, or Selenium where dynamic content demands it. Java sits behind the scenes for orchestration and integration with existing services; Spring Boot is already in place, so wiring new endpoints or scheduled jobs into that stack should feel natural to you.
Key deliverables
• Scrapers that reliably collect product titles, prices, SKUs, images, and stock status, delivered as modular Python scripts or a Scrapy project.
• A Java-based automation module that picks up the scraped JSON/CSV output, persists it (PostgreSQL is preferred), and triggers any post-processing you advise.
• Clear setup instructions plus a short README so future tweaks are straightforward.
Acceptance criteria
1. Scrapers run from the command line and finish without unhandled errors.
2. Output matches the product details schema we agree on, with at least 95 % field completeness across a test batch.
3. Java scheduler processes the files automatically and logs success/failure events for every run.
If you have examples of earlier Python-to-Java hand-offs or e-commerce data pipelines, that will help us move very quickly.
Location & Details
Apply on source →About this listing
This remote opportunity was imported from Freelancer and is shown here for discovery. To apply, follow the link to the original posting.