Data Enrichment & AI-Powered Product Intelligence

Project scope
Categories
Data analysis Artificial intelligence Data scienceSkills
business problems beautifulsoup web scraping application programming interface (api) analytical thinking information gathering data science python (programming language) data wranglingProject Scope
The CANADA List empowers Canadians to support local businesses by offering an independent database of thousands of products, each rated for its true impact on the Canadian economy. To take our platform to the next level, we’re looking for student collaborators to harness the power of AI and automated data collection—enriching our product listings with richer, more useful information for consumers.
The goal: Collect, verify, and structure key data points across the products in our database, to improve accuracy, usability, and depth of the resource.
Student Learning Opportunities
Students will:
- Apply and develop skills in data science, AI/ML, and web scraping
- Gain practical experience working with real-world, large-scale product datasets
- Solve real business problems and see the direct impact of their work
- Learn how to manage and structure business data for maximum value
- Practice communicating technical findings to a non-technical client
Support & Mentorship
Students will work directly with TCL’s founder and team for:
- Technical and strategic mentorship
- Weekly or biweekly check-ins
- Guidance on project requirements and priorities
- Feedback and support on troubleshooting and overcoming data challenges
Ideal Student Skills
- Python or relevant scripting language
- Familiarity with web scraping frameworks (e.g., BeautifulSoup, Scrapy, Selenium) and/or AI APIs
- Data wrangling and JSON/database organization
- Analytical thinking and attention to detail
- Strong documentation and communication skills
Why Join This Project?
This project offers a chance to build your portfolio with a meaningful, high-impact data science project for a recognized Canadian platform. You’ll tackle authentic challenges in data automation, AI, and business analytics—directly improving a resource used by thousands of Canadians.
Specific roles will include:
Automated Data Collection & Enrichment
- Use scraping tools and/or AI APIs to gather information regarding: The provincial availability of each product, the size of the underlying business (to flag small businesses), and product images
Data Verification & Organization
- Cross-validate scraped or AI-suggested data for reliability
- Structure all collected data in a standardized format matching The CANADA List’s JSON/database schema
Project outcomes will include:
- The collected information organized to match The CANADA List's JSON/database schema
Sharing knowledge in specific technical skills, techniques, methodologies required for the project.
Direct involvement in project tasks, offering guidance, and demonstrating techniques.
Providing access to necessary tools, software, and resources required for project completion.
Scheduled check-ins to discuss progress, address challenges, and provide feedback.
Supported causes
The global challenges this project addresses, aligning with the United Nations Sustainable Development Goals (SDGs). Learn more about all 17 SDGs here.
About the company
The CANADA List is an independent platform that helps Canadians make informed, patriotic purchasing decisions by rating and reviewing thousands of products and brands based on Canadian ownership, manufacturing, sourcing, and job impact. Our mission is to empower consumers to support local businesses and strengthen the Canadian economy, while bringing greater transparency to the products on Canadian shelves.