This web application was built as a research tool for the Digital Media Alliance of Florida, designed to collect and classify data on digital media companies across the state. The project involved developing a web scraper to extract company information from public records, structuring the data in a database organized by county, and importing NAICS codes for classification.
To enhance accuracy, I implemented a variant of Latent Semantic Indexing (LSI) to analyze company names, descriptions, and other metadata, allowing the system to automatically determine the business category of each company. The platform features an intuitive web interface that enables researchers to review, verify, and correct classifications as needed. Additionally, I integrated Mapbox to provide a visual representation of digital media companies across Florida.
This project showcases my ability to build end-to-end data-driven applications, combining web scraping, data processing, machine learning-based classification, and interactive visualization to support research and decision-making.