CoronaWhy Spanish Flu research framework
Using ML techniques to overlay CoronaWhy Spanish Flu data on COVID-19. You can find all datasets published in CoronaWhy Data Lake.
Regular meetings
We're sharing all meetings on YouTube, please feel free to join us if you would like to contribute.
- Kick-off #team-social-analysis
- Discussion about datasets and goals
# Datasets
Download the latest version of the KB Spanish flu dataset
wget http://datasets.coronawhy.org/api/access/datafile/503748 -O data.tar.gz;gzip -cd data.tar.gz|tar xf - wget http://datasets.coronawhy.org/api/access/datafile/741787 -O congress.tar.gz;gzip -cd congress.tar.gz|tar xf -# Framework installation Download Language Identification Model:wget https://dl.fbaipublicfiles.com/fasttext/supervised-models/lid.176.binInstall fasttext modulepip install fasttext# Usage Run Language Detection processpython3 ./main.py# Results File citations.txt with relevant fragments will be produced based on keywords defined in config.py # CoronaWhy infrastructure You can also do full-text search in the whole collection by querying Elasticsearch index spanishflucurl "http://search.coronawhy.org/spanishflu/_search?pretty=true&q=*"
Built With
- jupyter-notebook
- python
Log in or sign up for Devpost to join the conversation.