Using language algorithms to detect fake online profiles that deceive other users
Many adult content websites incorporate social networking features. Although these are popular, they raise significant challenges, including the potential for users to “catfish”, i.e., to create fake profiles to deceive other users. This paper takes an initial step towards automated catfish detection. We explore the characteristics of the different age and gender groups, identifying a number of distinctions. Through this, we train models based on user profiles and comments, via the ground truth of specially verified profiles. Applying our models for age and gender estimation of unverified profiles, we identify 38% of profiles who are likely lying about their age, and 25% who are likely lying about their gender. We find that women have a greater propensity to catfish than men. Further, whereas women catfish select from a wide age range, men consistently lie about being younger. Our work has notable implications on operators of such online social networks, as well as users who may worry about interacting with catfishes.
Paper to appear in IEEE/ACM ASONAM 2017 https://arxiv.org/abs/1705.06530
Dr Walid Magdy, University of Edinburgh, School of Informatics.