Hoo Hacks

MySQL Python Flask

1) What is ScholarED?

ScholarED is an AI based tool that leverages Machine Learning, NLP and cloud computing to generate research paper recommendations from over 500,000+ research papers in our dataset (subset from the Arxiv dataset) based on the input provided by the user (input can be provided in various image,video,audio and document formats).

2) Inspiration ?

Finding similar papers to a publication is one of the most crucial and challenging aspect of the research as researches need to manually inspect similar papers due to the inability of softwares like google scholar and Microsoft academic to accept non textual input making the process extremely inefficient.Adding to this, lack of access to university resources and libraries due to COVID-19 have amplified the existing obstacles faced during research. These challenged paved way for our project.

3) Implementation:

ml_hyl

ml

pdashboard

4) Features :

• Allows input file in 3 formats i.e PDF, Video, Audio

• Allows the input file to be in any language which can further be translated in english, french, spanish.

• Generates a textual summary of the input file provided by the user regardless of the format of submission(video/audio/image/PDF).

• Generates top 3 video recommendations based on the input

• Generates top 3 recommendation of similar research papers.

• Has a personalized dashboard for each member for them to come back and view their past work

image

5) Types of inputs accepted :

• Video: MP4 • Audio: MP3 • Document: PDF • Image: JPG/JPEG/PNG

6) Some Use Cases:

• Further explore concepts discussed in class lectures recordings.

• Find research papers similar to a perviously read publications.

• get papers that can be used for citations of your essays.

7) Technologies Used :

• HTML5 •CSS3 •SCSS •Python • MySQL• Flask •JS •JupyterNotebook •PowerShell

8) API's used :

• Google Cloud Vision API

• IBM Watson Speech-to-Text API

• Youtube API v3

• Google Translate API

• BART-Large CNN model

• MySQL

9) Machine Learning Models used :

• BART- Large CNN model

•Specter (Generates Embeddings)

Share this project:

Updates