Distances derived from word embeddings can measure a range of gradational relations—similarity, hierarchy, entailment, and stereotype—and can be used at the document- and author-level in ways that overcome some of the limitations of weighted dictionary methods. We provide a comprehensive introduction to using word embeddings for relation induction, and demonstrate how such techniques can complement dictionary methods as unsupervised, deductive methods.
AcevesPedroEvansJames A.. 2023. “Mobilizing Conceptual Spaces: How Word Embedding Models Can Inform Measurement and Theory Within Organization Science.” Organization Science: 1-27.
2.
AkramAl-Turk. 2020. The Rise of Performance-Based Accountability in Education in the United States: 1965-1994. PhD thesis, University of North Carolina at Chapel Hill.
3.
AntoniakMariaMimnoDavid. 2018. “Evaluating the Stability of Embedding-Based Word Similarities.” Transactions of the Association for Computational Linguistics6:107-19.
4.
AroraSanjeevLiYuanzhiLiangYingyuMaTengyuRisteskiAndrej. 2016a. “A Latent Variable Model Approach to PMI-Based Word Embeddings.” Transactions of the Association for Computational Linguistics4:385-99.
5.
AroraSanjeevLiangYingyuMaTengyu. 2016b. “A Simple but Tough-to-Beat Baseline for Sentence Embeddings.” 5th International Conference on Learning Representations.
6.
Arseniev-KoehlerAlina. 2021. “Theoretical Foundations and Limits of Word Embeddings: What Types of Meaning Can They Capture?” arXiv 2107.10413.
7.
Arseniev-KoehlerAlinaCochranSusan D.MaysVickie M.ChangKai-WeiFosterJacob Gates. 2021. “Integrating Topic Modeling and Word Embedding to Characterize Violent Deaths.” arXiv 2106.14365.
8.
Arseniev-KoehlerAlinaFosterJacob G.. 2022. “Machine Learning As a Model for Cultural Learning: Teaching An Algorithm What it Means to be Fat.” Sociological Methods & Research51:1484-539.
9.
ArtetxeMikelLabakaGorkaLopez-GazpioIñigoAgirreEneko. 2018. “Uncovering Divergent Linguistic Information in Word Embeddings With Lessons for Intrinsic and Extrinsic Evaluation.” arXiv 1809.02094.
10.
AslanidisParis. 2018. “Measuring Populist Discourse With Semantic Text Analysis: An Application on Grassroots Populist Mobilization.” Quality & Quantity52:1241-63.
11.
AtasuKubilayParnellThomasDünnerCelestineSifalakisManolisPozidisHaralamposVasileiadisVasileiosVlachosMichailBerrospiCesarLabbiAbdel. 2017. “Linear-Complexity Relaxed Word Mover’s Distance With GPU Acceleration.” Pp. 889-96 in 2017 IEEE International Conference on Big Data, IEEE.
12.
BaayenHarald R.. 2002. Word Frequency Distributions. Berlin, Germany. Springer.
13.
BatzdorferVeronikaSteinmetzHolgerBiellaMarcoAlizadehMeysam. 2021. “Conspiracy Theories on Twitter: Emerging Motifs and Temporal Dynamics During the COVID-19 Pandemic.” International Journal of Data Science and Analytics13:315-333.
14.
BerryGeorgeTaylorSean J.. 2017. “Discussion Quality Diffuses in the Digital Public Square.” Pp. 1371-380 in Proceedings of the 26th International Conference on World Wide Web, International World Wide Web Conferences Steering Committee.
BhattAnjali M.GoldbergAmirSrivastavaSameer B.. 2021. “A Language-Based Method for Assessing Symbolic Boundary Maintenance Between Social Groups.” Sociological Methods & Research51(4):1681-1720.
17.
BojanowskiPiotrGraveEdouardJoulinArmandMikolovTomas. 2017. “Enriching Word Vectors With Subword Information.” Transactions of the Association for Computational Linguistics5:135-46.
18.
BolukbasiTolgaChangKai-WeiZouJamesSaligramaVenkateshKalaiAdam. 2016a. “Quantifying and Reducing Stereotypes in Word Embeddings.” arXiv 1606.06121.
19.
BolukbasiTolgaChangKai-WeiZouJames Y.SaligramaVenkateshKalaiAdam T.. 2016b. “Man is to Computer Programmer as Woman Is to Homemaker? Debiasing Word Embeddings.” Pp. 4349-357 in Advances in Neural Information Processing Systems 29.
20.
BouraouiZiedJameelShoaibSchockaertSteven. 2018. “Relation Induction in Word Embeddings Revisited.” Pp. 1627-637 in Proceedings of the 27th International Conference on Computational Linguistics.
21.
BoutylineAndreiCornellDevinArseniev-KoehlerAlina. 2021. “All Roads Lead to Polenta.” Sociological Forum36(S1):1419-1445.
BrokosGeorgios-IoannisMalakasiotisProdromosAndroutsopoulosIon. 2016. “Using Centroids of Word Embeddings and Word Mover’s Distance for Biomedical Document Retrieval in Question Answering.” arXiv 1608.03905.
24.
BrunilaMikaelLaVioletteJack. 2021. “WMDecompose: A Framework for Leveraging the Interpretable Properties of Word Movers Distance in Sociocultural Analysis.” arXiv 2110.07330.
25.
BrysbaertMarcWarrinerAmy BethKupermanVictor. 2014. “Concreteness Ratings for 40 Thousand Generally Known English Word Lemmas.” Behavior Research Methods46:904-11.
26.
CaliskanAylinBrysonJoanna J.NarayananArvind. 2017. “Semantics Derived Automatically From Language Corpora Contain Human-Like Biases.” Science (New York, NY)356:183-6.
27.
CaliskanAylinLewisMolly. 2020. “Social Biases in Word Embeddings and Their Relation to Human Cognition.” PsyArXiv d84kg.
28.
CarboneLucaMijsJonathan. 2022. “Sounds Like Meritocracy to My Ears: Exploring the Link Between Inequality in Popular Music and Personal Culture.” Information, Communication and Society25(5):707-725. https://doi.org/10.1080/1369118X.2021.2020870
29.
Charu C.AggarwalHinneburgAlexanderKeimDaniel A.. 2001. “On the Surprising Behavior of Distance Metrics in High Dimensional Space.” Pp. 420-34 in Database Theory. Springer.
30.
ChengMengjieSmithDaniel ScottRenXiangCaoHanchengSmithSanneMcFarlandDaniel A.. 2023. “How New Ideas Diffuse in Science.” American Sociological Review.00031224231166955.88(3):522-561.
31.
ChersoniEmmanueleXiangRongLuQinHuangChu-Ren. 2020. “Automatic Learning of Modality Exclusivity Norms With Crosslingual Word Embeddings.” Pp. 32-38 in Proceedings of the Ninth Joint Conference on Lexical and Computational Semantics.
32.
ChiangHsiao-YuCamacho-ColladosJosePardosZachary. 2020. “Understanding the Source of Semantic Regularities in Word Embeddings.” Pp. 119-31 in Proceedings of the 24th Conference on Computational Natural Language Learning.
33.
CouilletRomainCinarYagmur GizemGaussierEricImranMuhammad. 2020. “Word Representations Concentrate and This Is Good News!” Pp. 325-34 in Proceedings of the 24th Conference on Computational Natural Language Learning.
34.
DamienFrancoisWertzVincentVerleysenMichel. 2007. “The Concentration of Fractional Distances.” IEEE Transactions on Knowledge and Data Engineering19:873-86.
35.
DeterdingNicole M.WatersMary C.. 2018. “Flexible Coding of In-Depth Interviews: A Twenty-First-Century Approach.” Sociological Methods & Research. 50(2):708-739.
36.
DingTaoRoyArpitaChenZhiyuanZhuQianPanShimei. 2016. “Analyzing and Retrieving Illicit Drug-Related Posts from Social Media.” Pp. 1555-560 in 2016 IEEE International Conference on Bioinformatics and Biomedicine.
37.
DingwallNicholasPottsChristopher. 2018. “Mittens: An Extension of GloVe for Learning Domain-Specialized Representations.”.
38.
DodgeJesseSapMaartenMarasovićAnaAgnewWilliamIlharcoGabrielGroeneveldDirkMitchellMargaretGardnerMatt. 2021. “Documenting Large Webtext Corpora: A Case Study on the Colossal Clean Crawled Corpus.”.
39.
DuYuhaoJosephKenneth. 2020. “MDR Cluster-Debias: A Nonlinear Word Embedding Debiasing Pipeline.” Pp. 45-54 in Social, Cultural, and Behavioral Modeling.
40.
DurrheimKevinSchuldMariaMafundaMartinMazibukoSindisiwe. 2022. “Using Word Embeddings to Investigate Cultural Biases.” The British Journal of Social Psychology62(1):617-629.
41.
EarlJenniferSouleSarah A.McCarthyJohn D.. 2003. “Protest Under Fire? Explaining the Policing of Protest.” American Sociological Review68:581-606.
42.
EnggaardThygeLohseAugustPedersenMorten AxelLehmannSune. 2023. “Dialectograms: Machine Learning Differences Between Discursive Communities.”.
43.
FellbaumChristiane. 1998. WordNet: An Electronic Lexical Database. Cambridge, MA: MIT Press.
44.
FloresRené D.. 2017. “Do Anti-immigrant Laws Shape Public Sentiment? A Study of Arizona’s SB 1070 Using Twitter Data.” American Journal of Sociology123:333-84.
45.
FryeMargaretGheihmanNina. 2018. “Like Bees to a Flower: Attractiveness, Risk, and Collective Sexual Life in An AIDS Epidemic.” Sociological Science5:596-627.
46.
FuldaNancyRicksDanielMurdochBenWingateDavid. 2017. “What Can You Do With a Rock? Affordance Extraction via Word Embeddings.” arXiv prvolume arXiv:1703.03429.
47.
GargNikhilSchiebingerLondaJurafskyDanZouJames. 2018. “Word Embeddings Quantify 100 Years of Gender and Ethnic Stereotypes.” Proceedings of the National Academy of Sciences of the United States of America115:E3635-E3644.
48.
GartenJustinHooverJoeJohnsonKate M.BoghratiReihaneIskiwitchCarolDehghaniMorteza. 2018. “Dictionaries and Distributions: Combining Expert Knowledge and Large Scale Textual Data Content Analysis.” Behavior Research Methods50:344-61.
49.
GennaroGloriaAshElliott. 2021. “Emotion and Reason in Political Language.” The Economic Journal132(643):1037-1059.
50.
GentzkowMatthewShapiroJesse M.TaddyMatt. 2018. “Congressional Record for the 43rd–114th Congresses: Parsed Speeches and Phrase Counts.”https://data.stanford.edu/congresstext.
51.
GentzkowMatthewShapiroJesse M.TaddyMatt. 2019. “Measuring Group Differences in High-dimensional Choices: Method and Application to Congressional Speech.” Econometrica: Journal of the Econometric Society87:1307-40.
52.
GoldbergAmirSrivastavaSameer B.Govind ManianV.MonroeWilliamPottsChristopher. 2016. “Fitting in Or Standing out? The Tradeoffs of Structural and Cultural Embeddedness.” American Sociological Review81:1190-222.
53.
GonenHilaGoldbergYoav. 2019. “Lipstick on a Pig: Debiasing Methods Cover Up Systematic Gender Biases in Word Embeddings But Do Not Remove Them.” arXiv 1903.03862.
54.
GrandGabrielBlankIdan AsherPereiraFranciscoFedorenkoEvelina. 2018. “Semantic Projection: Recovering Human Knowledge of Multiple, Distinct Object Features From Word Embeddings.” arXiv prvolume arXiv:1802.01241.
55.
GülleKim JulianFordNicholasEbelPatrickBrokhausenFlorianVogelsangAndreas. 2020. “Topic Modeling on User Stories Using Word Mover’s Distance.” Pp. 52-60 in 2020 IEEE Seventh International Workshop on AIRE.
56.
HaberJaren. 2021. “Sorting Schools: A Computational Analysis of Charter School Identities and Stratification.” Sociology of Education94(1):43-64.
57.
HaberJarenHavemanHeatherHongYoon Sung. 2021. “Toward Computational Literature Reviews: Applying Expert-Built Dictionaries for Automated Analysis of Complex Texts.”.
58.
HamiltonWilliam L.LeskovecJureJurafskyDan. 2016. “Diachronic Word Embeddings Reveal Statistical Laws of Semantic Change.” Pp. 1489-501 in Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Association for Computational Linguistics.
HosseinAzarpanahFarhadlooMohsen. 2021. “Measuring Biases of Word Embeddings: What Similarity Measures and Descriptive Statistics to Use?” Pp. 8-14 In Proceedings of the First Workshop on Trustworthy Natural Language Processing, Association for Computational Linguistics.
61.
HuMinqingLiuBing. 2004. “Mining and Summarizing Customer Reviews.” Pp. 168-77 in Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.
62.
IacobacciIgnacioPilehvarMohammad TaherNavigliRoberto. 2016. “Embeddings for Word Sense Disambiguation: An Evaluation Study.” pp. 897-907 in Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Association for Computational Linguistics.
63.
JiaqiMuBhatSumaViswanathPramod. 2017. “All-But-the-Top: Simple and Effective Postprocessing for Word Representations.”.
64.
JockersMatthew L.. 2015. Syuzhet: Extract Sentiment and Plot Arcs from Text.
65.
JonesJason J.AminMohammad RuhulKimJessicaSkienaSteven. 2020. “Stereotypical Gender Associations in Language Have Decreased Over Time.” Sociological Science7:1-35.
66.
JosephKennethMorganJonathan H.. 2020. “When Do Word Embeddings Accurately Reflect Surveys on Our Beliefs About People?” arXiv 2004.12043.
67.
KafeEric. 2019. “Fitting Semantic Relations to Word Embeddings.” p. 228 in Wordnet Conference.
68.
KantorovichL. V.. 1960. “Mathematical Methods of Organizing and Planning Production.” Management Science6:366-422.
69.
KaripbayevaAidanaSorokinaAlenaAssylbekovZhenisbek. 2019. “A Critique of the Smooth Inverse Frequency Sentence Embeddings.” arXiv 1909.13494.
70.
KhodakMikhailSaunshiNikunjLiangYingyuMaTengyuStewartBrandonAroraSanjeev. 2018. “A La Carte Embedding: Cheap but Effective Induction of Semantic Feature Vectors.”.
71.
KoreniusTuomoLaurikkalaJormaJuholaMartti. 2007. “On Principal Component Analysis, Cosine and Euclidean Measures in Information Retrieval.” Information Sciences177:4893-905.
KovácsBalázsCarrollGlenn R.LehmanDavid W.. 2017. “The Perils of Proclaiming An Authentic Organizational Identity.” Sociological Science4:80-106.
74.
KozlowskiAustin C.TaddyMattEvansJames A.. 2019. “The Geometry of Culture: Analyzing the Meanings of Class Through Word Embeddings.” American Sociological Review84:905-49.
75.
KusnerMattSunYuKolkinNicholasWeinbergerKilian. 2015. “From Word Embeddings to Document Distances.” Pp. 957-66 in International Conference on Machine Learning.
76.
LandauerThomas K.DumaisSusan T.. 1997. “A Solution to Platos Problem: The Latent Semantic Analysis Theory of Acquisition, Induction, and Representation of Knowledge.” Psychological Review104:211.
77.
LarsenAnders Boesen LindboSønderbySøren KaaeLarochelleHugoWintherOle. 2016. “Autoencoding Beyond Pixels Using a Learned Similarity Metric.” Volume 48, Pp. 1558-566 in Proceedings of the 33rd International Conference on Machine Learning.
78.
LawsonCarol. 1995. “After a Protest by Parents, Crayola Changes its Recipes.” The New York Times. Nov. 15, 1995. Section C:11. https://www.nytimes.com/1995/11/15/garden/after-a-protest-by-parents-crayola-changes-its-recipes.html
79.
LawsonM. AsherMartinAshley E.HudaImrulMatzSandra C.. 2022. “Hiring Women Into Senior Leadership Positions is Associated With a Reduction in Gender Stereotypes in Organizational Language.” PNAS119.
80.
LazaridouAngelikiMarelliMarcoBaroniMarco. 2017. “Multimodal Word Meaning Induction From Minimal Exposure to Natural Text.” Cognitive Science41 Suppl 4:677-705.
81.
LeQuocMikolovTomas. 2014. “Distributed Representations of Sentences and Documents.” Pp. 1188-196 in International Conference on Machine Learning.
82.
LeschkeJulia C.SchwemmerCarsten. 2019. “Media Bias Towards African-Americans Before and After the Charlottesville Rally.” P. 10 in Weizenbaum Conference.
83.
LiChangchunOuyangJihongLiXiming. 2019. “Classifying Extremely Short Texts by Exploiting Semantic Centroids in Word Mover’s Distance Space.” Pp. 939-49 in The World Wide Web Conference.
84.
LiuBingHuMinqingChengJunsheng. 2005. “Opinion Observer: Analyzing and Comparing Opinions on the Web.” Pp. 342-51 in Proceedings of the 14th International Conference on WWW.
85.
LixKatharinaGoldbergAmirSrivastavaSameerValentineMelissa A.. 2020. “Aligning Differences: Discursive Diversity and Team Performance.”
86.
LynottDermotConnellLouiseBrysbaertMarcBrandJamesCarneyJames. 2020. “The Lancaster Sensorimotor Norms.” Behavior Research Methods52:1271-91.
87.
ManziniThomasLimYao ChongTsvetkovYuliaBlackAlan W.. 2019. “Black Is to Criminal as Caucasian Is to Police: Detecting and Removing Multiclass Bias in Word Embeddings.” arXiv 1904.04047.
88.
Martin-CaugheyAnanda. 2021. “What’s in An Occupation? Investigating Within-Occupation Variation and Gender Segregation Using Job Titles and Task Descriptions.” American Sociological Review86(5):960-999.
89.
McCumberAndrewDavisAdam. 2022. “Elite Environmental Aesthetics: Placing Nature in a Changing Climate.” American Journal of Cultural Sociology. doi: https://doi.org/10.1057/s41290-022-00179-w
90.
MihaylovTodorNakovPreslav. 2019. “SemanticZ at SemEval-2016 Task 3: Ranking Relevant Answers in Community Question Answering Using Semantic Similarity Based on Fine-Tuned Word Embeddings.” arXiv 1911.08743.
91.
MikolovTomasYihWen-tauZweigGeoffrey. 2013a. “Linguistic Regularities in Continuous Space Word Representations.” Pp. 746-51 in Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Atlanta, Georgia. Association for Computational Linguistics.
92.
MikolovTomasYihWen-tauZweigGeoffrey. 2013b. “Linguistic Regularities in Continuous Space Word Representations.” Pp. 746-51 in Proceedings of the 2013 Conference of the NAACL.
93.
MilesMatthew B.HubermanMichael A.. 1994. Qualitative Data AnalysisThousand Oaks: SAGE.
NelsonLaura K. 2021. “Leveraging the Alignment Between Machine Learning and Intersectionality: Using Word Embeddings to Measure Intersectional Experiences of the Nineteenth Century U.S. South.” Poetics p. 101539.
98.
NelsonLaura K.BurkDerekKnudsenMarcelMcCallLeslie. 2021. “The Future of Coding: A Comparison of Hand-Coding and Three Types of Computer-Assisted Text Analysis Methods.” Sociological Methods & Research50:202-37.
99.
The New York Times. 1965. “Soviet Show Picketed in Ohio.” The New York Times.
100.
The New York Times. 1967. “Chicago Unit Sues to Fight Pollution of Lake Michigan.” The New York Times.
101.
The New York Times. 1972. “G.E. Resists War Protest; Honeywell Bars Arms Halt.” The New York Times.
102.
NothmanJoelQinHanminYurchakRoman. 2018. “Stop Word Lists in Free Open-Source Software Packages.” Pp. 7-12 in Proceedings of Workshop for NLP-OSS.
103.
OrnaghiAriannaAshElliottChenDaniel L.. 2019. “Stereotypes in High-Stakes Decisions: Evidence From US Circuit Courts.” Center for Law & Economics Working Paper Series2. doi:https://doi.org/10.3929/ethz-b-000376877
104.
OsgoodCharles EgertonSuciGeorge J.TannenbaumPercy H.. 1957. The Measurement of Meaning. Champaign, IL: University of Illinois Press.
105.
PangBoLeeLillianVaithyanathanShivakumar. 2002. “Thumbs Up?: Sentiment Classification Using Machine Learning Techniques.” Pp. 79-86 in Proceedings of the ACL-02 Conference on Empirical Methods in Natural Language Processing.
106.
Pardo-GuerraJuan PabloPahwaPrithviraj. 2022. “The Extended Computational Case Method: A Framework for Research Design.” Sociological Methods & Research51:1826-67.
107.
PaxtonPamelaVelascoKristopherResslerRobert W.. 2020. “Does Use of Emotion Increase Donations and Volunteers for Nonprofits?” American Sociological Review85:1051-83.
108.
PenningtonJeffreySocherRichardManningChristopher D.. 2014. “Glove: Global Vectors for Word Representation.” Pp. 1532-543 in Proceedings of the 2014 Conference on EMNLP.
109.
PiantadosiSteven T.. 2014. “Zipf’s Word Frequency Law in Natural Language.” Psychonomic Bulletin & Review21:1112-30.
110.
RheaultLudovicCochraneChristopher. 2019. “Word Embeddings for the Analysis of Ideological Placement in Parliamentary Corpora.” Pp. 1-22 Political Analysis: An Annual Publication of the Methodology Section of the American Political Science Association.
111.
RichieRussellZouWanlingBhatiaSudeep. 2019. “Distributional Semantic Representations Predict High-Level Human Judgment in Seven Diverse Behavioral Domains.” Pp. 2654-660 in Proceedings of the 41st Annual Conference of the Cognitive Science Society.
112.
RinkerTyler. 2022. “Lexicon: R Package.” https://CRAN.R-project.org/package=lexicon.
113.
RobertoFranzosi. 2021. “What’s in a Text? Bridging the Gap Between Quality and Quantity in the Digital Era.” Quality & Quantity55:1513-40.
114.
RodmanEmma. 2020. “A Timely Intervention: Tracking the Changing Meanings of Political Concepts With Word Vectors.” Political Analysis28:87-111.
115.
RodriguezPedroSpirlingArthur. 2021. “Word Embeddings: What Works, What Doesn’t, and How to Tell the Difference for Applied Research.” Journal of Politics.84(1):101-115. https://doi.org/10.1086/715162
116.
RodriguezPedroSpirlingArthurStewartBrandon. 2023. “Embedding Regression: Models for Context-Specific Description and Inference.” American Political Science Review117(4):1255-1274. https://doi.org/10.1017/S0003055422001228
117.
RollerStephenErkKatrin. 2016. “Relations Such as Hypernymy: Identifying and Exploiting Hearst Patterns in Distributional Vectors for Lexical Entailment.” arXiv 1605.05433.
118.
RossielloGaetanoBasilePierpaoloSemeraroGiovanni. 2017. “Centroid-Based Text Summarization Through Compositionality of Word Embeddings.” Pp. 12-21 in Proceedings of the MultiLing 2017 Workshop on Summarization and Summary Evaluation Across Source Types and Genres. Association for Computational Linguistics.
119.
RubnerYossiTomasiCarloGuibasLeonidas J.. 1998. “A Metric for Distributions With Applications to Image Databases.” Pp. 59-66 in Sixth International Conference on Computer Vision. IEEE.
120.
SaltonGerardBuckleyChristopher. 1988. “Term-Weighting Approaches in Automatic Text Retrieval.” Information Processing & Management24:513-23.
121.
SchlenderThaleaSpanakisGerasimos. 2020. “‘Thy Algorithm Shalt Not Bear False Witness’: An Evaluation of Multiclass Debiasing Methods on Word Embeddings.” arXiv 2010.16228.
SelivanovDmitriyBickelManuelWangQing. 2020. “text2vec: Modern Text Mining Framework for R.”.
124.
SelivanovDmitriyBickelManuelWangQing. 2020.“text2vec: Modern Text Mining Framework for R.” https://CRAN.R-project.org/package=text2vec.
125.
SiaSuzannaDalmiaAyushMielkeSabrina J.. 2020. “Tired of Topic Models? Clusters of Pretrained Word Embeddings Make for Fast and Good Topics Too!” arXiv 2004.14914.
126.
SidorovGrigoriGelbukhAlexanderGómez-AdornoHelenaPintoDavid. 2014. “Soft Similarity and Soft Cosine Measure: Similarity of Features in Vector Space Model.” Computación y Sistemas18:491-504.
127.
SnefjellaBryorKupermanVictor. 2015. “Concreteness and Psychological Distance in Natural Language Use.” Psychological Science26(9):1449-1460. https://doi.org/10.1177/0956797615591771
128.
StoltzDustin S.TaylorMarshall A.. 2019. “Concept Mover’s Distance: Measuring Concept Engagement Via Word Embeddings in Texts.” Journal of Computational Social Science2:293-313.
StoltzDustin STaylorMarshall A.. 2022. "text2map: R Tools for Text Matrices." Journal of Open Source Software7(72):3741. https://doi.org/10.21105/joss
131.
StoltzDustin S.TaylorMarshall A.DudleyJennifer S. K.. 2023a. “The Dynamics of Collective Action Corpus [Data set].” https://doi.org/10.5281/ zenodo.8415049.
132.
StoltzDustin S.TaylorMarshall A.DudleyJennifer S. K.. 2023b“Replication Repository for ‘A Tool Kit for Relation Induction in Text Analysis’.” https://doi. org/10.5281/zenodo.8415049.
133.
StraussAnselm L.. 1987. Qualitative Analysis for Social ScientistsCambridge: Cambridge University Press.
134.
TausczikYla R.PennebakerJames W.. 2010. “The Psychological Meaning of Words: LIWC and Computerized Text Analysis Methods.” Journal of Language and Social Psychology29:24-54.
135.
TaylorMarshall A.StoltzDustin S.. 2020a. “Concept Class Analysis: A Method for Identifying Cultural Schemas in Texts.” Sociological Science7:544-69.
136.
TaylorMarshall A.StoltzDustin S.. 2020b. “Integrating Semantic Directions With Concept Mover’s Distance to Measure Binary Concept Engagement.” Journal of Computational Social Science4:231-242. https://doi.org/10.1007/s42001-020-00075-8
137.
UtsumiAkira. 2020. “Exploring What Is Encoded in Distributional Word Vectors: A Neurobiologically Motivated Analysis.” Cognitive Science44:e12844.
138.
ValentiniFranciscoRosatiGermánSlezakDiego FernandezAltszylerEdgar. 2023. “The Undesirable Dependence on Frequency of Gender Bias Metrics Based on Word Embeddings.”.
139.
van DongenStijnEnrightAnton J.. 2012. “Metric Distances Derived From Cosine Similarity and Pearson and Spearman Correlations.” arXiv 1208.3145.
140.
van LoonAustinGiorgiSalvatoreWillerRobbEichstaedtJohannes. 2022. “Negative Associations in Word Embeddings Predict Anti-Black Bias Across Regions—But Only Via Name Frequency.” Proceedings of the International AAAI Conference on Web and Social Media16:1419-24.
141.
VoyerAndreaKlineZachary D.DantonMadison. 2022a. “Symbols of Class: A Computational Analysis of Class Distinction-Making Through Etiquette, 1922-2017.” Poetics p. 101734.
142.
VoyerAndreaKlineZachary D.DantonMadisonVolkovaTatiana. 2022b. "From Strange to Normal: Computational Approaches to Examining Immigrant Incorporation Through Shifts in the Mainstream.” Sociological Methods & Research51(4):1540-1579.
143.
VylomovaEkaterinaRimellLauraCohnTrevorBaldwinTimothy. 2016. “Take and Took, Gaggle and Goose, Book and Read: Evaluating the Utility of Vector Differences for Lexical Relation Learning.” Pp. 1671-682 in Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics.
144.
WangDan J.SouleSarah A.. 2012. “Social Movement Organizational Collaboration: Networks of Learning and the Diffusion of Protest Tactics, 1960-1995.” American Journal of Sociology117:1674-722.
145.
WietingJohnBansalMohitGimpelKevinLivescuKaren. 2016. “Charagram: Embedding Words and Sentences via Character n-Grams.” arXiv 1607.02789.
146.
WilsonDavid S.. 1988. “Deaf Actress’s Use of Speech Proves Divisive Among Peers.” The New York Times.
147.
WoodMichael Lee. 2023. “Measuring Cultural Diversity in Text With Word Counts.” Social Psychology Quarterly. Online First. 10.1177/01902725231194356
148.
YıldırımSavaşYıldızTuğba. 2018. “Learning Turkish Hypernymy Using Word Embeddings.” International Journal of Computational Intelligence Systems11:371-83.
149.
YuShuiyuanXuChunshanLiuHaitao. 2018. “Zipf’s Law in 50 Languages.” arXiv 1807.01855.
150.
ZhangXuchaoZongBoChengWeiNiJingchaoLiuYanchiChenHaifeng. 2021. “Unsupervised Concept Representation Learning for Length-Varying Text Similarity.” Pp. 5611-620 in Proceedings of the 2021 Conference of the NAACL.