Filter by type:

Sort by year:

Harvesting Social Signals to Inform Peace Processes Implementation and Monitoring

Aastha Nigam, Henry K. Dambanemuya, Madhav Joshi, Nitesh Chawla
Journal Big Data Journal, 2017

Abstract

Peace processes are complex, protracted, and contentious involving significant bargaining and compromising among various societal and political stakeholders. In civil war terminations, it is pertinent to measure the pulse of the nation to ensure that the peace process is responsive to citizens' concerns. Social media yields tremendous power as a tool for dialogue, debate, organization, and mobilization thereby adding more complexity to the peace process. Using Colombia's final peace agreement and national referendum as a case study, we investigate the influence of two important indicators: inter-group polarization and public sentiment towards the peace process. We present a detailed linguistic analysis to detect inter-group polarization and a predictive model that leverages tweet structure, content, and user-based features to predict public sentiment towards the Colombian peace process. We demonstrate that had pro-accord stakeholders leveraged public opinion from social media the outcome of the Colombian referendum could have been different.

Harvesting Social Signals to Inform Peace Processes Implementation and Monitoring

Aastha Nigam, Henry K. Dambanemuya, Madhav Joshi, Nitesh Chawla
Poster Grace Hopper Conference (GHC) 2017, Orlando - FL & Women in Statistics and Data Science, October 2017, La Jolla - CA

Abstract

Peace processes are complex, protracted, and contentious involving significant bargaining and compromising among various societal and political stakeholders. In civil war terminations, it is pertinent to measure the pulse of the nation to ensure that the peace process is responsive to citizens' concerns. Social media yields tremendous power as a tool for dialogue, debate, organization, and mobilization thereby adding more complexity to the peace process. Using Colombia's final peace agreement and national referendum as a case study, we investigate the influence of two important indicators: inter-group polarization and public sentiment towards the peace process. We present a detailed linguistic analysis to detect inter-group polarization and a predictive model that leverages tweet structure, content, and user-based features to predict public sentiment towards the Colombian peace process. We demonstrate that had pro-accord stakeholders leveraged public opinion from social media the outcome of the Colombian referendum could have been different.

A Content-Driven Framework for Online User Understanding

Aastha Nigam, Nitesh Chawla
Book Chapter Social Media Analytics: Advances and Applications, 2017

Connecting the Dots to Infer Followers' Topical Interest on Twitter

Aastha Nigam, Salvador Aguinaga, Nitesh V. Chawla
Paper IEEE International Conference on Behavioral, Economic, and Socio-Cultural Computing, 2016

Abstract

Twitter provides a platform for information sharing and diffusion, and has quickly emerged as a mechanism for organizations to engage with their consumers. A driving factor for engagement is providing relevant and timely content to users. We posit that the engagement via tweets offers a good potential to discover user interests and leverage that information to target specific content of interest. To that end, we have developed a framework that analyzes tweets to identify the interests of current followers and leverages topic models to deliver a personalized topic profile for each user. We validated our framework by partnering up with a local media company and analyzing the content gap between them and their followers. We also developed a mobile application that incorporates the proposed framework.

Link Prediction in a Semi-bipartite Network for Recommendation

Aastha Nigam, Nitesh V. Chawla
Paper 8th Asian Conference, ACIIDS 2016, Da Nang, Vietnam, March 14-16, 2016, Proceedings, Part II, Pages 127-135

Abstract

There is an increasing trend amongst users to consume information from websites and social media. With the huge influx of content it becomes challenging for the consumers to navigate to topics or articles that interest them. Particularly in health care, the content consumed by a user is controlled by various factors such as demographics and lifestyle. In this paper, we use a semi-bipartite network model to capture the interactions between users and health topics that interest them. We use a supervised link prediction approach to recommend topics to users based on their past reading behavior and contextual data associated to a user such as demographics.

Social Context Perception for Mobile Robots

Aastha Nigam, Laurel D. Riek
Paper In Proc. of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2015

Abstract

As robots enter human spaces, unique perception challenges are emerging. Sensing human activity, adapting to highly dynamic environments, and acting coherently and contingently is challenging when robots transition from structured environments to human-centric ones. We approach this problem by employing context-based perception, a biologically-inspired, low-cost approach to sensing that leverages noisy, global features. Across several months, our mobile robot collected real-world, multimodal data from multi-use locations; where the same space might be used for many different activities. We then ran a series of unimodal and multimodal classification experiments. We successfully classified several aspects of situational context from noisy data, and, to our knowledge are the first group to do so. This work represents an important step toward enabling robots that can readily leverage context to solve perceptual tasks.

Disguise Detection and Face Recognition using Visible and Thermal Images

Tejas I. Dhamecha, Aastha Nigam, Richa Singh and Mayank Vatsa
Paper 6th IAPR International Conference on Biometrics, June, 2013

Abstract

Face verification, though for humans seems to be an easy task, is a long-standing research area. With challenging covariates such as disguise or face obfuscation, automatically verifying the identity of a person is assumed to be very hard. This paper explores the feasibility of face verification under disguise variations using multi-spectrum (visible and thermal) face images. We propose a framework, termed as Anavrta, which classifies the local facial regions of both visible and thermal face images into biometric (regions without disguise) and non-biometric (regions with disguise) classes. The biometric patches are then used for facial feature extraction and matching. The performance of the algorithm is evaluated on the IIITD In and Beyond Visible Spectrum Disguise database that is prepared by the authors and contains images pertaining to 75 subjects with different kinds of disguise variations. The experimental results suggest that the proposed framework improves the performance compared to existing algorithms, however there is a need for more research to address this important covariate.

Large-Scale Learning with AdaGrad on Spark

Asmelash Teka Hadgu, Aastha Nigam and Ernesto Diaz-Aviles
Poster Big Data (Big Data), 2015 IEEE International Conference on, Oct. 29 2015-Nov. 1 2015, pp 2828 - 2830

Abstract

Stochastic Gradient Descent (SGD) is a simple yet very efficient online learning algorithm for optimizing convex (and often non-convex) functions and one of the most popular stochastic optimization methods in machine learning today. One drawback of SGD is that it is sensitive to the learning rate hyper-parameter. The Adaptive Sub-gradient Descent, AdaGrad, dynamically incorporates knowledge of the geometry of the data observed in earlier iterations to calculate a different learning rate for every feature. In this work, we implement a distributed version of AdaGrad for large-scale machine learning tasks using Apache Spark. Apache Spark is a fast cluster computing engine that provides similar scalability and fault tolerance properties to MapReduce, but in contrast to Hadoop's two-stage disk-based MapReduce paradigm, Spark's multi-stage in-memory primitives allow user programs to load data into a cluster's memory and query it repeatedly, which makes it ideal for building scalable machine learning applications. We empirically evaluate our implementation on large-scale real-world problems in the machine learning canonical tasks of classification and regression. Comparing our implementation of AdaGrad with the SGD scheduler currently available in Spark's Machine Learning Library (MLlib), we experimentally show that AdaGrad saves time by avoiding manually setting a learning-rate hyperparameter, converges fast and can even achieve better generalization errors.

Topic Models to Increase User Engagement on Twitter

Aastha Nigam Salvador Aguinaga and Nitesh V. Chawla
Poster Grace Hopper Conference (GHC) 2015, Houston - TX & Computing Research Association - Women (CRA-W) Workshop 2015, San Fransisco - CA

Abstract

Twitter is a micro-blogging site which enables individuals to write about their daily activities, express opinions, share information and connect with other users and businesses. Twitter is widely used by companies or brands to reach out to their customers, increase awareness about various topics and products. According to a Twitter report1, there are 284 million monthly active users, and 500 million tweets shared every day. This provides a unique business opportunity for companies to reach out to their customers, increase awareness and interaction among users about a topic or product and keep them engaged with their content. Using the 80/20 principle2, 80% of the Tweets should be directed towards interacting and engaging with the current followers. The relevant information cannot be captured using only the hashtags, mentions or keyword search, we need to model user’s interests based on their tweeting patterns and the content of the Tweet. The companies need to listen to the trending topics amongst their followers to connect with them. In order to discover content a user is or might be interested in, we need to build a topic profile for each user which contains high level topics which he/she discusses in the post3.