1a. NRC Word–Emotion Association Lexicon (EmoLex)
Homepage and interactive visualization available on the lexicon page. Also available in 40+ languages; sense-level annotations for eight emotions are provided.
This page lists word association lexicons capturing sentiment, emotion, and colour associations. These resources can be used to analyze emotions in text. Please review the Emotion Lexicons: Ethics & Data Statement and see Terms of Use below before using a lexicon.
Contact: Saif M. Mohammad (saif.mohammad@nrc-cnrc.gc.ca)
Created via expert or crowdsourced annotation. Real-valued scores use Best–Worst Scaling for reliable, fine-grained values.
Homepage and interactive visualization available on the lexicon page. Also available in 40+ languages; sense-level annotations for eight emotions are provided.
English words with intensities for eight basic emotions using Best–Worst Scaling. Lexicon homepage.
English words with valence (positive–negative), arousal (excited–calm), and dominance/competence (powerful–weak, competent–incompetent) scores.
44k+ English words with association scores along the calmness–anxiety dimension. Lexicon homepage.
~31k English terms (≈26k unigrams; ≈5k MWEs) with real-valued associations for warmth (W), competence (C), sociability (S), and trust (T). Lexicon homepage.
Scores for two- and three-word expressions and their constituents.
Extracted from large corpora using co‑occurrence/statistical signals. These have higher coverage (domain‑specific terms) but may be less precise than manual lexicons.
Built from tweets tagged with emotion‑word hashtags. Hashtag Emotion Corpus (TEC) available.
| Lexicon | Version | Coverage | Scores | Creation & Domain |
|---|---|---|---|---|
| NRC Hashtag Sentiment Lexicon | 1.0 (2013) | 54,129 unigrams; 316,531 bigrams; 308,808 pairs | −∞ to ∞ | Automatic from sentiment‑hashtag tweets · Domain: Twitter |
| NRC Hashtag Affirmative/Negated Context Lexicons | 1.0 (2014) | Affirmative: 36,357 unigrams, 159,479 bigrams · Negated: 7,592 unigrams, 23,875 bigrams | −∞ to ∞ | Automatic from tweets; separate entries for context |
| Emoticon Lexicon (Sentiment140) | 1.0 (2014) | 62,468 unigrams; 677,698 bigrams; 480,010 pairs | −∞ to ∞ | Automatic from emoticon‑bearing tweets · Domain: Twitter |
| Sentiment140 Affirmative/Negated Context | 1.0 (2014) | Affirmative: 45,255 unigrams; 240,076 bigrams · Negated: 9,891 unigrams; 34,093 bigrams | −∞ to ∞ | Automatic from tweets; separate entries for context |
a) Yelp Restaurant Sentiment Lexicon
(built from the Yelp Dataset for selected restaurant categories).
b) Amazon Laptop Sentiment Lexicon
| Resource | Version | Coverage | Scores | Creation & Domain |
|---|---|---|---|---|
| Yelp Restaurant | 1.0 (2014) | 39,274 unigram entries (incl. affirmative/negated); 276,651 bigram entries | −∞ to ∞ | Automatic from Yelp reviews · Domain: Restaurants |
| Amazon Laptop | 1.0 (2014) | 26,577 unigram entries (incl. affirmative/negated); 155,167 bigram entries | −∞ to ∞ | Automatic from Amazon reviews · Domain: Laptops |