GitHub

How we built it: What we did is pull data from the different monarch butterfly and AQI sites given to us, and we extracted data like daily AQI, temperature, monarch butterfly count and more, then compared these values with one another. We then generated a custom LLM model via OpenAI that can process and recreate CSV files with relevant season, county, AQI, temperature, and FIPS columns given only a date, state, and city. We first visualized the monarch count, creating a script to normalize our data, then generate a heat map of the U.S. to visualize the dispersion of Monarch sightings by county over the last 15 years using Python libraries like matplotlib and geopandas. Next we mapped this information across relevant data such as Milkweed growth rates, temperature, and AQI by county to scan for similarities in migration patterns. We then started a deeper statistical analysis of relevant variables, beginning with a linear regression model, before pivoting to a negative binomial regression model due to our Y value being a count variable, and our data-set being heavily over-dispersed. This allowed us to adequately find a correlation between season, year, air quality, and temperature across 15 years to map and explain the decline of monarch butterflies, as well as generate a potential solution.

Challenges we ran into: Our original AI analysis was consistently finding no correlation, contrary to our pre-processing research. After analyzing our model choice, we realized we weren't using the best approach, and had to pivot last-minute. Our data pre-processing initiative was very long due to maintaining a grueling verification process of our data transformations, however this allowed us to verify datasets as we worked, rather than encountering errors and having to fix them later in our process.

Accomplishments that we're proud of: Created CSVs through batch processing so we could test things at a small level, and scale it up to find multiple correlation factors. Created several CSVs using Python, allowing us to visualize our data and more. Utilizing multiple AI models and visualization tactics.

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
115795-V3		115795-V3
csv-files-data		csv-files-data
extra-csvs		extra-csvs
generated_maps		generated_maps
monarch_sightings_csv		monarch_sightings_csv
monarch_sightings_maps		monarch_sightings_maps
test-graphs		test-graphs
text-files-data		text-files-data
2spatial.py		2spatial.py
3test.py		3test.py
4test.py		4test.py
README.md		README.md
analysis.py		analysis.py
binomial.py		binomial.py
combine.py		combine.py
elv-transfer.py		elv-transfer.py
filter.py		filter.py
migrate_dataset.py		migrate_dataset.py
normalize-data.py		normalize-data.py
normalized_monarch_sightings.csv		normalized_monarch_sightings.csv
outlier-check.py		outlier-check.py
relad.py		relad.py
scale.py		scale.py
spatial.py		spatial.py
spatial_head.py		spatial_head.py
temp-transfer.py		temp-transfer.py
temp.py		temp.py
test.py		test.py
transfer.py		transfer.py
transformed_monarch_sightings.csv		transformed_monarch_sightings.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

TeaganSmith/Datathon2024

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages