EEMCS

Home > Publications
Home University of Twente
Education
Research
Prospective Students
Jobs
Publications
Intranet (internal)
 
 Nederlands
 Contact
 Search
 Organisation

EEMCS EPrints Service


24093 TweetGenie: automatic age prediction from tweets
Home Policy Brochure Browse Search User Area Contact Help

Nguyen, Dong-Phuong and Gravel, Rilana and Trieschnigg, R.B. and Meder, Theo (2013) TweetGenie: automatic age prediction from tweets. ACM SIGWEB Newsletter, 4 (Autumn). 4. ISSN 1931-1745

Full text available as:

PDF
- Univ. of Twente only
375 Kb

Official URL: http://dx.doi.org/10.1145/2528272.2528276

Exported to Metis

Abstract

A person’s language use reveals much about the person’s social identity, which is based on the social categories a person belongs to including age and gender. We discuss the development of TweetGenie, a computer program that predicts the age of Twitter users based on their language use. We explore age prediction in three different ways: classifying users into age categories, by life stages, and predicting their exact age. An automatic system achieves better performance than humans on these tasks. Both humans and the automatic systems tend to underpredict the age of older people. We find that most linguistic changes occur when people are young, and that after around 30 years the studied variables show little change, making it difficult to predict the ages of older Twitter users.

Item Type:Article
Research Group:EWI-HMI: Human Media Interaction
Research Program:CTIT-NICE: Natural Interaction in Computer-mediated Environments
Research Project:FACT: Folktales As Classifiable Texts
Uncontrolled Keywords:language, age, twitter, prediction, social media, sociolinguistics
ID Code:24093
Status:Published
Deposited On:07 January 2014
Refereed:No
International:Yes
More Information:statisticsmetis

Export this item as:

To request a copy of the PDF please email us request copy

To correct this item please ask your editor

Repository Staff Only: edit this item