Live App: https://hackalytics-eq2iqkxqwylq3pzbp9vf7b.streamlit.app
Hackalytics is an interactive data dashboard that analyzes hackathon trends from 2009 → 2025 using real project data. It helps answer a simple but powerful question:
What are hackers building, using, and focusing on over time?
Hackalytics breaks down hackathon trends into three core insights:
We analyze project descriptions to surface recurring themes and keywords.
- Word cloud of common ideas
- Theme trends over time
We track the technologies used in projects across years.
- Bar chart of most-used tools
- Distribution breakdown of tech stacks
We visualize global hackathon distribution using location data.
- Interactive map
- Top locations by year
- Cleaned and processed hackathon + project datasets
- Extracted:
- Themes
- Tools ("built with")
- Locations
- Aggregated trends by year
- Built an interactive dashboard using Streamlit
To improve performance, we used caching so data only loads once instead of reprocessing on every interaction.
- Python
- Pandas
- Streamlit
- PyDeck
- Matplotlib
- WordCloud
- Clone the repository:
git clone https://github.com/sneha-byte/Hackalytics.git
cd Hackalytics- Set up python virtual environment and install dependencies
- Scrape data
cd scraping
python3 scrape_hackathons.py
python3 run_chunks 0 9
scrapy crawl HackathonLocationSpider -O ../data/locations.csv -a dataset="../data/hackathons.csv"- Process data.
- Set up .env file for geocoding
GOOGLE_MAPS_API_KEY=<your key>
- Run process.ipynb
- Run analysis.ipynb
- Deploy streamlit dashboard
cd app
streamlit run Home.py


