I am a Data Scientist working under Dr. Brenda Curtis in the Technology and Translational Research Unit. I received my PhD in Computer Science at the University of Pennsylvania working under H. Andrew Schwartz and Lyle Ungar. My primary research interest is community centered NLP: developing geo-spatial NLP methods to measure relationships between individuals and their communities using social media language. I'm also interested in machine learning applications to substance use and recovery.

Finally, I am a father, a first generation / low income (FGLI) student, and a community college graduate. Please reach out if you'd like to talk about similar PhD experiences.

sgiorgi (at) sas (dot) upenn (dot) edu


Key language markers of depression on social media depend on race.
Sunny Rai, Elizabeth C. Stadeb. Salvatore Giorgi, Ashley Francisco, Lyle H. Ungar, Brenda Curtis, and Sharath C. Guntuku. PNAS, 2024.
Lived Experience Matters: Automatic Detection of Stigma on Social Media Toward People Who Use Substances.
Salvatore Giorgi, Douglas Bellew, Daniel Roy Sadek Habib, Garrick Sherman, Joao Sedoc, Chase Smitterberg, Amanda Devoto, McKenzie Himelein-Wachowiak, and Brenda Curtis. ICWSM, 2024.
Filling in the White Space: Spatial Interpolation with Gaussian Processes and Social Media Data.
Salvatore Giorgi, Johannes C. Eichstaedt, Daniel Preotiuc-Pietro, Jacob R. Gardner, H. Andrew Schwartz, Lyle H. Ungar. CRESP, 2023.
PDF Bib Data Code
A Linguistic Analysis of Dehumanization toward Substance Use across Three Decades of News Articles.
Salvatore Giorgi, Daniel Habib, Douglas Bellew, Garrick Sherman, Brenda Curtis. Frontiers in Public Health-Substance Use Disorders and Behavioral Addictions, 2023.
PDF Code
AWARE-TEXT: An Android Package for Mobile Phone Based Text Collection and On-Device Processing.
Salvatore Giorgi*, Garrick Sherman*, Douglas Bellew, Sharath Chandra Guntuku, Lyle Ungar, Brenda Curtis. NLP-OSS at EMNLP, 2023.
PDF Bib Code
Leveraging AI to predict substance use disorder treatment outcomes.
Salvatore Giorgi and Brenda Curtis. Neuropsychopharmacology, 2023.
"I Slept Like a Baby": Using Human Traits To Characterize Deceptive ChatGPT and Human Text.
Salvatore Giorgi*, David M. Markowitz*, Nikita Soni, Vasudha Varadarajan, Siddharth Mangalik and H. Andrew Schwartz. IACT at SIGIR, 2023.
Smartphone sensor data estimate alcohol craving in a cohort of patients with alcohol-associated liver disease and alcohol use disorder.
Tiffany Wu, Garrick Sherman, Salvatore Giorgi, Priya Thanneeru, Lyle H. Ungar, Patrick S. Kamath, Douglas A. Simonetto, Brenda L. Curtis, and Vijay H. Shah. Hepatology Communication, 2023.
Extended impact of the COVID-19 pandemic: Trajectories of mental health and substance use among U.S. adults, September 2020–August 2021.
Xiangyu Tao, Tingting Liu, Salvatore Giorgi. Celia B. Fisher, and Brenda Curtis. Drug and Alcohol Dependence Reports, 2023.
Findings of WASSA 2023 Shared Task on Empathy, Emotion and Personality Detection in Conversation and Reactions to News Articles.
Valentin Barriere, Joao Sedoc, Shabnam Tafreshi, and Salvatore Giorgi. WASSA, 2023.
Characterizing Empathy and Compassion Using Computational Linguistic Analysis.
David B. Yaden*, Salvatore Giorgi*, Matthew Jordan, Anneke Buffone, Johannes C. Eichstaedt, H. Andrew Schwartz, Lyle Ungar, and Paul Bloom. Emotion 2023.
PDF Supplement Bib Data
Predicting U.S. County Opioid Poisoning Mortality From Multi-Modal Social Media and Psychological Self-Report Data.
Salvatore Giorgi, David B. Yaden, Johannes C. Eichstaedt, Lyle H. Ungar, H. Andrew Schwartz, Amy Kwarteng, Brenda Curtis. Scientific Reports 2023.
PDF Supplement Bib Data
Author as Character and Narrator: Deconstructing Personal Narratives from the r/AmITheAsshole Reddit Community.
Salvatore Giorgi, Ke Zhao, Alexander H. Feng, Lara J. Martin. International Conference on Web and Social Media (ICWSM) 2023.
PDF Bib Data
Different Affordances on Facebook and SMS Text Messaging Do Not Impede Generalization of Language-Based Predictive Models.
Tingting Liu*, Salvatore Giorgi*, Xiangyu Tao, Sharath Chandra Guntuku, Douglas Bellew, Brenda Curtis, Lyle Ungar. International Conference on Web and Social Media (ICWSM) 2023.
PDF Supplement Bib Code
Towards Well-Being Measurement with Social Media Across Space, Time and Cultures: Three Generations of Progress.
Oscar Kjell, Salvatore Giorgi, H. Andrew Schwartz, Johannes C. Eichstaedt. World Happiness Report (Chapter 5), 2023.
AI-based analysis of social media language predicts addiction treatment dropout at 90 days.
Brenda Curtis, Salvatore Giorgi, Lyle Ungar, Huy Vu, David Yaden, Tingting Liu, Kenna Yadeta, H. Andrew Schwartz. Neuropsychopharmacology 2023.
PDF Supplement Bib
Measuring disadvantage: A systematic comparison of United States small-area disadvantage indices.
Sophia Lou*, Salvatore Giorgi*, Tingting Liu, Johannes C Eichstaedt, Brenda Curtis. Health and Place 2023.
PDF Supplement Bib Data
Opioid death projections with AI-based forecasts using social media language.
Matthew Matero, Salvatore Giorgi, Brenda Curtis, Lyle H Ungar, H Andrew Schwartz. NPJ Digital Medicine 2023.
Role of the media in promoting the dehumanization of people who use drugs.
Daniel Roy Sadek Habib, Salvatore Giorgi, Brenda Curtis. The American Journal of Drug and Alcohol Abuse 2023.
The Text-Package: An R-Package for Analyzing and Visualizing Human Language Using Natural Language Processing and Transformers.
Oscar Kjell, Salvatore Giorgi, H. Andrew Schwartz. Psychological Methods 2023.
Differences in mental health and alcohol use across profiles of COVID-19 disruptions.
Aaliyah Gray, Tingting Liu, Salvatore Giorgi, Celia B Fisher, Brenda Curtis. Alcohol and Alcoholism 2023.
PDF Bib Supplement
Depression and anxiety on Twitter during the COVID-19 stay-at-home period in 7 major US cities.
Danielle Levanti, Rebecca N Monastero, Mohammadzaman Zamani, Johannes C Eichstaedt, Salvatore Giorgi, H Andrew Schwartz, Jaymie R Meliker. AJPM focus 2023.
A Cross-Modal Study of Pain Across Communities in the United States.
Arnav Aggarwal, Sunny Rai, Salvatore Giorgi, Shreya Havaldar, Garrick Sherman, Juhi Mittal, Sharath Chandra Guntuku. SocialNLP 2023.
Pandemic distress associated with segregation and social stressors.
Rodman Turpin, Salvatore Giorgi, Brenda Curtis. Frontiers in Public Health 2023.
PDF Bib Data
COVID-related social determinants of substance use disorder among diverse US racial ethnic groups.
Xiangyu Tao, Tingting Liu, Celia B Fisher, Salvatore Giorgi, Brenda Curtis. Social Science & Medicine 2023.
Correcting Sociodemographic Selection Biases for Population Prediction from Social Media.
Salvatore Giorgi, Veronica Lynn, Keshav Gupta, Farhan Ahmed, Sandra Matz, Lyle Ungar, and H. Andrew Schwartz. International Conference on Web and Social Media (ICWSM) 2022.
PDF Supplement Bib Data Code
Twitter Corpus of the #BlackLivesMatter Movement And Counter Protests: 2013 to 2021.
Salvatore Giorgi, Sharath Chandra Guntuku, McKenzie Himelein-Wachowiak, Amy Kwarteng, Sy Hwang, Muhammad Rahman, and Brenda Curtis. International Conference on Web and Social Media (ICWSM) 2022.
PDF Bib Data Code
Nonsuicidal Self-Injury and Substance Use Disorders: A Shared Language of Addiction.
Salvatore Giorgi, McKenzie Himelein-Wachowiak, Daniel Habib, Lyle Ungar, and Brenda Curtis. Workshop on Computational Linguistics and Clinical Psychology (CLPsych) 2022.
PDF Supplement Bib
Daily diary study of loneliness, alcohol, and drug use during the COVID-19 Pandemic.
Elise Bragard, Salvatore Giorgi, Paul Juneau, and Brenda Curtis. Alcoholism: Clinical and Experimental Research (ACER) 2022.
Linguistic predictors from Facebook postings of substance use disorder treatment retention versus discontinuation.
Tingting Liu, Salvatore Giorgi, Kenna Yadeta, H. Andrew Schwartz, Lyle H. Ungar, and Brenda Curtis. The American Journal of Drug and Alcohol Abuse 2022.
A Human-Centered Hierarchical Framework for Dialogue System Construction and Evaluation.
Salvatore Giorgi, Farhan Ahmed, Lyle Ungar, and H. Andrew Schwartz. The Tenth Dialog System Technology Challenge at AAAI (DSTC10) 2022.
PDF Poster Bib
Negative Associations in Word Embeddings Predict anti-Black Bias Across Regions--but only via Name Frequency.
Austin van Loon, Salvatore Giorgi, Johannes Eichstaedt, and Robb Willer. International Conference on Web and Social Media (ICWSM) 2022.
Getting "clean" from nonsuicidal self-injury: Experiences of addiction on the subreddit r/selfharm.
McKenzie Himelein-Wachowiak, Salvatore Giorgi, Amy Kwarteng, Destiny Schriefer, Chase Smitterberg, Kenna Yadeta, Elise Bragard, Amanda Devoto, Lyle Ungar, and Brenda Curtis. Journal of Behavioral Addictions 2022.
PDF Bib Press Preregistration
Modeling Latent Dimensions of Human Beliefs.
Huy Vu, Salvatore Giorgi, Jeremy D. W. Clifton, Niranjan Balasubramanian, and H. Andrew Schwartz. International Conference on Web and Social Media (ICWSM) 2022.
Using Facebook language to predict and describe excessive alcohol use.
Rupa Jose, Matthew Matero, Garrick Sherman, Brenda Curtis, Salvatore Giorgi, H. Andrew Schwartz, and Lyle Ungar. Alcoholism: Clinical and Experimental Research (ACER) 2022.
Feasibility of Mobile Health and Social Media–Based Interventions for Young Adults With Early Psychosis and Clinical Risk for Psychosis: Survey Study.
Olivia Franco, Monica Calkins, Salvatore Giorgi, Lyle H. Ungar, Raquel Gur, Christian Kohler, and Sunny Tang. JMIR Form Res 2022.
Regional personality assessment through social media language.
Salvatore Giorgi, Khoa Le Nguyen, Johannes C. Eichstaedt, Margaret L. Kern, David. B. Yaden, Michal Kosinski, Martin E. P. Seligman, Lyle H. Ungar, H. Andrew Schwartz, and Gregory Park. Journal of Personality 2021.
PDF Data Bib
Characterizing Social Spambots by their Human Traits.
Salvatore Giorgi, Lyle H. Ungar, and H. Andrew Schwartz. Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021.
PDF Bib Press Press
Well-Being Depends on Social Comparison: Hierarchical Models of Twitter Language Suggest That Richer Neighbors Make You Less Happy.
Salvatore Giorgi, Sharath Chandra Guntuku, Johannes C. Eichstaedt, Claire Pajot, H. Andrew Schwartz, and Lyle H. Ungar. International Conference on Web and Social Media (ICWSM) 2021.
PDF Supplement Data Bib
Discovering Black Lives Matter Events in the United States: Shared Task 3, CASE 2021.
Salvatore Giorgi, Vanni Zavarella, Hristo Tanev, Nicolas Stefanovitch, Sy Hwang, Hansi Hettiarachchi, Tharindu Ranasinghe, Vivek Kalyan, Paul Tan, Shaun Tan, Martin Andrews, Tiancheng Hu, Niklas Stoehr, Francesco Ignazio Re, Daniel Vegh, Dennis Atzenhofer, Brenda Curtis, and Ali Hürriyetoğlu. Challenges and Applications of Automated Extraction of Socio-political Events from Text (CASE) 2021.
PDF Data Bib
The emotional and mental health impact of the murder of George Floyd on the US population.
Johannes C. Eichstaedt, Garrick T. Sherman, Salvatore Giorgi, Steven O. Roberts, Megan E. Reynolds, Lyle H. Ungar, and Sharath Chandra Guntuku. Proceedings of the National Academy of Sciences (PNAS) 2021.
PDF Supplement Data Bib
Loneliness and Daily Alcohol Consumption During the COVID-19 Pandemic.
Elise Bragard, Salvatore Giorgi, Paul Juneau, Brenda L. Curtis. Alcohol and Alcoholism 2021.
Bots and misinformation spread on social media: A mixed scoping review with implications for COVID-19.
McKenzie Himelein-Wachowiak, Salvatore Giorgi, Amanda Devoto, Muhammad Rahman, Lyle Ungar, H. Andrew Schwartz, David H. Epstein, Lorenzo Leggio, and Brenda Curtis. Journal of Medical Internet Research (JMIR) 2021.
Beyond Beliefs: Multidimensional Aspects of Religion and Spirituality in Language.
David B. Yaden, Salvatore Giorgi, Margaret L. Kern, Alejandro Adler, Lyle H. Ungar, Martin E. P. Seligman, and Johannes C. Eichstaedt. Psychology of Religion and Spirituality 2021.
COVID-Related Victimization, Racial Bias and Employment and Housing Disruption Increase Mental Health Risk Among U.S. Asian, Black and Latinx Adults.
Celia B. Fisher, Xiangyu Tao, Tingting Liu, Salvatore Giorgi, and Brenda L. Curtis. Frontiers in Public Health 2021.
Understanding Weekly COVID-19 Concerns through Dynamic Content-Specific LDA Topic Modeling.
Mohammadzaman Zamani, H. Andrew Schwartz, Johannes Eichstaedt, Sharath Chandra Guntuku, Adithya Virinchipuram Ganesan, Sean Clouston and Salvatore Giorgi. NLP+CSS 2020.
PDF Data Bib
Information-seeking vs. sharing: Which explains regional health? An analysis of Google Search and Twitter trends.
Kokil Jaidka, Johannes Eichstaedt, Salvatore Giorgi, H. Andrew Schwartz, and Lyle H. Ungar. Telematics and Informatics 2020.
Closed- and Open-Vocabulary Approaches to Text Analysis: A Review, Quantitative Comparison, and Recommendations.
Johannes Eichstaedt, Margaret L. Kern, David B. Yaden, H.A. Schwartz, Salvatore Giorgi, Gregory Park, Courtney A. Hagan, Victoria Tobolsky, Laura K. Smith, Anneke Buffone, Jonathan Iwry, Martin E. P. Seligman and Lyle H. Ungar. Psychological Methods 2020.
Estimating geographic subjective well-being from Twitter: A comparison of dictionary and data-driven language methods.
Kokil Jaidka, Salvatore Giorgi, H. Andrew Schwartz, Margaret L. Kern, Lyle H. Ungar, and Johannes C. Eichstaedt. Proceedings of the National Academy of Sciences 2020.
PDF Supplement Data Bib
Quantifying Community Characteristics of Maternal Mortality.
Rediet Abebe*, Salvatore Giorgi*, Anna Tedijanto, Anneke Buffone, and H. Andrew Schwartz. The Web Conference 2020, IC2S2 2020.
PDF Data Bib
Cultural Differences in Tweeting about Drinking Across the U.S.
Salvatore Giorgi, David B. Yaden, Johannes C. Eichstaedt, Robert D. Ashford, Anneke Buffone, H. Andrew Schwartz and Lyle Ungar. International Journal of Environmental Research and Public Health 2020.
Exploring Substance Use Tweets of Youth in the United States: Mixed Methods Study
Robin Stevens, Bridgette Brawner, Elissa Kranzler, Salvatore Giorgi, Elizabeth Lazarus, Maramawit Abera, Sarah Huang and Lyle Ungar. JMIR Public Health and Surveillance 2020.
Digital recovery networks: Characterizing user participation, engagement, and outcomes of a novel recovery social network smartphone application.
Robert D. Ashford, Salvatore Giorgi, Beau Mann, Chris Pesce, Lon Sherritt, Lyle Ungar and Brenda Curtis. Journal of Substance Abuse Treatment 2019.
Tweet Classification without the Tweet: An Empirical Examination of User versus Document Attributes.
Veronica Lynn, Salvatore Giorgi, Niranjan Balasubramanian and H. Andrew Schwartz. NLP+CSS 2019.
PDF Poster Bib
Suicide Risk Assessment with Multi-level Dual-Context Language and BERT.
Matthew Matero, Akash Idnani, Youngseo Son, Salvatore Giorgi, Huy Vu, Mohammad Zamani, Parth Limbachiya, Sharath Chandra Guntuku and H. Andrew Schwartz. CLPsych 2019.
PDF Code Poster Bib
The Remarkable Benefit of User-Level Aggregation for Lexical-based Population-Level Predictions.
Salvatore Giorgi, Daniel Preotiuc-Pietro, Anneke Buffone, Daniel Rieman, Lyle H. Ungar and H. Andrew Schwartz. EMNLP 2018.
PDF Supplement Data Poster Bib
Residualized Factor Adaptation for Community Social Media Prediction Tasks.
Mohammadzaman Zamani, H. Andrew Schwartz, Veronica Lynn, Salvatore Giorgi and Niranjan Balasubramanian. EMNLP 2018.
PDF Data Bib
Primal World Beliefs.
Jeremy Clifton, Joshua D. Baker, Crystal L. Park, David B. Yaden, Alicia Clifton, Paolo Terni, Jessica L. Miller, Guang Zeng, Salvatore Giorgi, H. Andrew Schwartz and Martin E. P. Seligman. Psychological Assessment 2018.
PDF Supplement Bib
Current and Future Psychological Health Prediction using Language and Socio-Demographics of Children for the CLPysch 2018 Shared Task.
Sharath Chandra Guntuku, Salvatore Giorgi and Lyle H. Ungar. CLPSYCH 2018.
More Evidence that Twitter Language Predicts Heart Disease: A Response and Replication.
Johannes Eichstaedt, H. Andrew Schwartz, Salvatore Giorgi, Margaret L. Kern, Gregory Park , Maarten Sap, Darwin R. Labarthe, Emily E. Larson, Martin Seligman, and Lyle H. Ungar. PsyArXiv 2018.
PDF Data Bib
Can Twitter be used to predict county excessive alcohol consumption rates?
Brenda Curtis*, Salvatore Giorgi*, Anneke E. K. Buffone, Lyle H. Ungar, Robert D. Ashford, Jessie Hemmons, Dan Summers, Casey Hamilton, H. Andrew Schwartz. PLOSONE 2018.
PDF Data Bib
Modeling and Visualizing Locus of Control with Facebook Language.
Kokil Jaidka, Anneke Buffone, Salvatore Giorgi, Johannes Eichstaedt, Masoud Rouhizadeh, and Lyle Ungar. Proceedings of the International AAAI Conference on Web and Social Media 2018.
DLATK: Differential Language Analysis ToolKit.
H. Andrew Schwartz, Salvatore Giorgi, Maarten Sap, Patrick Crutchley, Johannes C. Eichstaedt, and Lyle Ungar. EMNLP 2017.
PDF Code Poster Bib
On the Distribution of Lexical Features at Multiple Levels of Analysis.
Fatemeh Almodaresi, Lyle Ungar, Vivek Kulkarni, M. Zakeri, Salvatore Giorgi. and H. Andrew Schwartz. ACL 2017.
Recognizing Pathogenic Empathy in Social Media.
Muhammad Abdul-Mageed, Anneke Buffone, Hao Peng, Salvatore Giorgi, Johannes Eichstaedt and Lyle Ungar. ICWSM 2017.
Does well-being translate on Twitter? A comparative evaluation of English and Spanish well-being lexica.
Laura Smith, Salvatore Giorgi, Rishi Solanki, Johannes Eichstaedt, H. Andrew, Schwartz, Muhammad Abdul-Mageed, Anneke Buffone and Lyle Ungar. EMNLP 2016.
PDF Data Poster Bib
Real men don't say "cute": Using automatic language analysis to isolate inaccurate aspects of stereotypes.
Jordan Carpenter, Daniel Preotiuc-Pietro, Lucie Flekova, Salvatore Giorgi, Courtney Hagan, Margaret Kern, Anneke Buffone, Lyle Ungar and Martin Seligman. SPSS 2016.
PDF Supplement Bib
Studying the Dark Triad of Personality using Twitter Behavior.
Daniel Preotiuc-Pietro, Jordan Carpenter, Salvatore Giorgi and Lyle Ungar. CIKM 2016.
Analyzing Biases in Human Perception of User Age and Gender from Text.
Lucie Flekova, Jordan Carpenter, Salvatore Giorgi, Lyle Ungar, and Daniel Preotiuc-Pietro. ACL 2016.
PDF Poster Bib
Analyzing crowdsourced assessment of user traits through Twitter posts.
Lucie Flekova, Daniel Preotiuc-Pietro, Jordan Carpenter, Salvatore Giorgi, and Lyle Ungar. HCOMP 2015.
PDF Supplement Poster Bib
Design and Evaluation of a Web-based Virtual Open Laboratory Teaching Assistant (VOLTA) for Circuits Laboratory
Firdous Saleheen, Salvatore Giorgi, Zachary Smith, Joseph Picone and Chang-Hee Won. ASEE Annual Conference and Exposition 2015.
Adaptive Neural Replication and Resilient Control Despite Malicious Attacks.
Salvatore Giorgi, Firdous Saleheen, Frank Ferrese and Chang-Hee Won. 5th International Symposium on Resilient Control Systems 2012.

* equal contribution


Black Lives Matter Twitter Corpus
A data set of 41.8 million tweets from 10 million users which contain one of the following keywords: BlackLivesMatter, AllLivesMatter and BlueLivesMatter.
Regional Personality Estimates
Twitter-based estimates of personality (openness, conscientiousness, extroversion, agreeableness, and neuroticism) for 2,000 U.S. counties.
County Tweet Lexical Bank
County level word and topic loading derived from a 10% Twitter sample from 2009-2015. Anonymized linguistic features extracted from over 1.5 billion English U.S County mapped tweets.


Differential Language Analysis ToolKit (DLATK)
DLATK is an end to end human text analysis package, specifically suited for social media and social scientific applications. It is written in Python 3 and developed by the World Well-Being Project at the University of Pennsylvania and Stony Brook University.
R package for transforming text to state-of-the-art word embeddings that are ready to be used for downstream tasks.
County Interpolation
Web interface for interpolating spatial data via Gaussian Processes.