Background: Companies House data only includes registered office addresses. We require the actual trading addresses (principal place(s) of business) for analysis, marketing outreach, or compliance.
Objective: Build a pipeline that takes a list of UK company numbers (and optional SIC codes), and outputs a CSV with:
Company number
Company name
Number of employees
Turnover (where available)
SIC code(s)
Trading address (street, city, postcode)
2. Scope of Work
Core Data Ingestion
Download/ingest the monthly Companies House bulk CSV (or use the Companies House API) to get company number, name, postcode, SIC code(s).
Trading-Address Enrichment
Primary method: Parse iXBRL filings for .
Fallback method: Query a Places‐API (e.g. Google Places or Foursquare) by “company name + postcode” to retrieve formatted address.
Data Merging & Cleanup
Consolidate registered vs. trading address fields.
Standardize address formatting.
Deduplicate and log failures for manual review.
Export & Delivery
Export a final CSV with the key fields.
Provide a short one-page README describing usage and dependencies
4. Required Skills & Experience
Strong Python (or Node.js) coding for data pipelines.
Experience parsing XBRL/iXBRL (e.g. python-iXBRL or equivalent).
Familiar with REST-API consumption (Companies House, Google/Foursquare, OpenCorporates).
Familiarity with web-scraping frameworks (Scrapy, BeautifulSoup, Puppeteer) is a plus.
Data cleansing and address standardization best practices.
Docker and CLI scripting for packaging (optional but preferred).
Milestones:
Core data ingestion + sample of 50 records
iXBRL enrichment + fallback API integration
Data cleanup, export & documentation
Please include in your proposal:
Relevant past projects / GitHub samples (especially XBRL or address-enrichment work).
Confirmation you can deliver the three key deliverables.
Employment History Background Check – NSW Category: Compliance, Customer Service, Data Analysis, Data Entry, Data Management, Documentation, Local Job, Photography, Report Writing, Research Budget: $10 - $30 USD
Healthcare Insights Dashboard Development Category: Data Visualization, Data Warehousing, Database Administration, Power BI, SQL, Statistical Analysis, Statistics, Tableau Budget: ₹12500 - ₹37500 INR
02-Nov-2025 17:02 GMT
Instagram Sales Growth Campaign for Mummy Ne Banaya Brand Category: Analytics, Content Creation, Content Marketing, Digital Marketing, Facebook Marketing, Google Adwords, Instagram Marketing, Internet Marketing, Social Media Management, Social Media Marketing Budget: ₹1500 - ₹12500 INR
02-Nov-2025 17:00 GMT
Digital Marketing Assistance Needed Category: Advertising, Content Creation, Digital Marketing, Email Marketing, Facebook Marketing, Internet Marketing, Marketing, Social Media Management Budget: $15 - $25 USD
02-Nov-2025 17:00 GMT
Dramatic InVideo TikTok Shorts Category: After Effects, Color Grading, Content Creation, Social Media Marketing, Video Editing, Video Post Editing, Video Production, Video Services, Videography Budget: $25 - $50 CAD
02-Nov-2025 17:00 GMT
Eye-Catching Social Media Graphics Category: Adobe Illustrator, Canva, Graphic Design, Illustration, Logo Design, Photoshop, Social Media Marketing Budget: ₹600 - ₹650 INR
02-Nov-2025 16:59 GMT
Causal Multimodal Agent for Chest X-Ray Analysis Category: Computer Vision, Data Science, Data Visualization, Deep Learning, Image Analysis, Image Processing, Machine Learning (ML), Natural Language Processing Budget: ₹1500 - ₹12500 INR
02-Nov-2025 16:57 GMT
English to Hindi translations Category: Graphic Design, Illustration, Logo Design, Photoshop, Photoshop Design Budget: ₹600 - ₹1500 INR
02-Nov-2025 16:57 GMT
Firmowa strona WWW z blogiem Category: CMS, Graphic Design, Internet Marketing, Link Building, PHP, SEO, Web Design, Web Development Budget: €30 - €250 EUR
02-Nov-2025 16:57 GMT
ClinicalTrials.gov Results Submission Category: Data Analysis, Data Management, Data Visualization, Health Care Management, Medical Writing, Research Writing, Statistical Analysis, XML Budget: $250 - $750 USD