Main Page

Table of Contents

Author Index

Sponsors & Supporters

Author Index

(Return to Top)

Abelson, Brian

Targeting Direct Cash Transfers to the Extremely Poor (Page 1563)


Abeysuriya, Romesh

Sleep Analytics and Online Selective Anomaly Detection (Page 362)


Ackermann, Chris

Beating the News' with EMBERS: Forecasting Civil Unrest Using Open Source Indicators (Page 1799)


Acs, Gergely

A Case Study: Privacy Preserving Release of Spatio-Temporal Density in Paris (Page 1679)


Adhikari, Samrachana

Early Prediction of Code Blue Using Electronic Medical Records (Page 1917)


Agarwal, Deepak

Activity Ranking in LinkedIn Feed (Page 1603)

Budget Pacing for Targeted Online Advertisements at LinkedIn (Page 1613)


Aggarwal, Charu C.

The Setwise Stream Classification Problem (Page 432)


(Return to Top)

Aggarwal, Varun

A System to Grade Computer Programming Skills Using Machine Learning (Page 1887)


Agrawal, Rakesh

Grouping Students in Educational Settings (Page 1017)


Ahmadi, Zahra

Prototype-based Learning on Concept-drifting Data Streams (Page 412)


Ahmed, Amr

Reducing the Sampling Complexity of Topic Models (Page 891)


Ahmed, Nesreen K.

Graph Sample and Hold: A Framework for Big-Graph Analytics (Page 1446)


Akiba, Takuya

Network Structural Analysis via Core-Tree-Decomposition (Page 1476)


Akoglu, Leman

Focused Clustering and Outlier Detection in Large Attributed Graphs (Page 1346)


Al-Rfou, Rami

DeepWalk: Online Learning of Social Representations (Page 701)


Amatriain, Xavier

The Recommender Problem Revisited: Morning Tutorial (Page 1971)


Anagnostopoulos, Aris

Event Detection in Activity Networks (Page 1176)


Anagnostopoulos, Christos

Scaling Out Big Data Missing Value Imputations: Pythia vs. Godzilla (Page 651)


(Return to Top)

Andreoli, Jean-Marc

New Algorithms for Parking Demand Management and a City Scale Deployment (Page 1819)


Anis, Aamir

Active Semi-Supervised Learning Using Sampling Theory for Graph Signals (Page 492)


Ansary, Rizwan

Mining Text Snippets for Images on the Web (Page 1534)


Appleton, James

Predicting Student Risks Through Longitudinal Analysis (Page 1544)


Arredondo, Jaime

Beating the News' with EMBERS: Forecasting Civil Unrest Using Open Source Indicators (Page 1799)


Attenberg, Josh

Style in the Long Tail: Discovering Unique Interests with Latent Variable Models in Large Scale Social E-Commerce (Page 1640)


Avvenuti, Marco

EARS (Earthquake Alert and Report System): A Real Time Decision Support System for Earthquake Crisis Management (Page 1749)


Babaie, Tahereh

Sleep Analytics and Online Selective Anomaly Detection (Page 362)


Bachman, Benjamin J.

Automated Hypothesis Generation Based on Mining Scientific Literature (Page 1877)


(Return to Top)

Badanidiyuru, Ashwinkumar

Streaming Submodular Maximization: Massive Data Summarization on the Fly (Page 671)


Bahadori, Mohammad Taha

FBLG: A Simple and Effective Approach for Temporal Dependence Discovery from Time Series Data (Page 382)


Bailey, James

Effective Global Approaches for Mutual Information Based Feature Selection (Page 512)


Baker, Simon

Mining Text Snippets for Images on the Web (Page 1534)


Balakrishnan, Shobana

Scalable Near Real-Time Failure Localization of Data Center Networks (Page 1689)


Banerjee, Siddhartha

FAST-Ppr: Scaling Personalized PageRank Estimation for Large Graphs (Page 1436)


Baraniuk, Richard G.

Time-Varying Learning and Content Analytics via Sparse Factor Analysis (Page 452)


Barbieri, Nicola

Who to Follow and Why: Link Prediction with Explanations (Page 1266)


Beckman, Richard

ISIS: A Networked-Epidemiology Based Pervasive Web App for Infectious Disease Pandemic Planning and Response (Page 1847)


Bendersky, Michael

Up Next: Retrieval Methods for Large Scale Related Video Suggestion (Page 1769)


Bengio, Yoshua

Scaling Up Deep Learning (Page 1966)


(Return to Top)

Benson, Austin R.

Learning Multifractal Structure in Large Networks (Page 1326)


Bertetto, John

Reducing Gang Violence Through Network Influence Based Targeting of Social Programs (Page 1829)


Beutel, Alex

CatchSync: Catching Synchronized Behavior in Large Directed Graphs (Page 941)


Bhagat, Smriti

On Social Event Organization (Page 1206)

Optimal Recommendations Under Attraction, Aversion, and Social Influence (Page 811)


Bhardwaj, Anurag

Large Scale Visual Recommendations from Street Fashion Images (Page 1925)


Bhasin, Anmol

Modeling Professional Similarity by Mining Professional Career Trajectories (Page 1945)


Bhattacharya, Indrajit

A Bayesian Framework for Estimating Properties of Network Diffusions (Page 1216)


Bhattacharyya, Prantik

LASTA: Large Scale Topic Assignment on Multiple Social Networks (Page 1809)


Bhowmick, Sanjukta

On the Permanence of Vertices in Network Communities (Page 1396)


Bi, Bin

Who Are Experts Specializing in Landscape Photography? Analyzing Topic-Specific Authority on Content Sharing Services (Page 1506)


Bisset, Keith R.

ISIS: A Networked-Epidemiology Based Pervasive Web App for Infectious Disease Pandemic Planning and Response (Page 1847)


(Return to Top)

Bonchi, Francesco

Core Decomposition of Uncertain Graphs (Page 1316)

Correlation Clustering: from Theory to Practice (Page 1972)

Who to Follow and Why: Link Prediction with Explanations (Page 1266)


Bordes, Antoine

Constructing and Mining Web-Scale Knowledge Graphs: KDD 2014 Tutorial (Page 1967)


Bourse, Florian

Balanced Graph Edge Partition (Page 1456)


Boutsidis, Christos

Provable Deterministic Leverage Score Sampling (Page 997)


Bradley, Paul

Industry & Government Track Welcome From Program Chairs


Brantingham, Patricia L.

Spatially Embedded Co-Offence Prediction Using Supervised Learning (Page 1789)


Brennan, Thomas

Management and Analytic of Biomedical Big Data with Cloud-Based In-Memory Database and Dynamic Querying: A Hands-on Experience with Real-world Data (Page 1970)


Brimmer, Nicole

Unfolding Physiological State: Mortality Modelling in Intensive Care Units (Page 75)


(Return to Top)

Bugdayci, Ahmet

Modeling Professional Similarity by Mining Professional Career Trajectories (Page 1945)


Buntine, Wray

Experiments with Non-Parametric Topic Models (Page 881)


Butler, Patrick

Beating the News' with EMBERS: Forecasting Civil Unrest Using Open Source Indicators (Page 1799)


Cadena, Jose

Beating the News' with EMBERS: Forecasting Civil Unrest Using Open Source Indicators (Page 1799)


Cafarella, Michael

Integrating Spreadsheet Data via Accurate and Low-Effort Extraction (Page 1126)


Cam, Hasan

Towards Scalable Critical Alert Mining (Page 1057)


Candan, K. Selçuk

LWI-Svd: Low-rank, Windowed, Incremental Singular Value Decompositions on Time-Evolving Data Sets (Page 987)


Cao, Caleb Chen

From Labor to Trader: Opinion Elicitation via Online Crowds as a Market (Page 1067)

TCS: Efficient Topic Discovery Over Crowd-Oriented Service Data (Page 861)


(Return to Top)

Cao, Lei

Detecting Moving Object Outliers in Massive-Scale Trajectory Streams (Page 422)


Castaldi, Peter J.

Dual Beta Process Priors for Latent Cluster Discovery in Chronic Obstructive Pulmonary Disease (Page 155)


Castellanos, Malu

Dynamics of News Events and Social Media Reaction (Page 901)


Castelluccia, Claude

A Case Study: Privacy Preserving Release of Spatio-Temporal Density in Paris (Page 1679)


Chakrabarti, Soumen

Open-Domain Quantity Queries on Web Tables: Annotation, Response, and Consensus Models (Page 711)


Chakraborty, Tanmoy

On the Permanence of Vertices in Network Communities (Page 1396)


Chan, Jeffrey

Effective Global Approaches for Mutual Information Based Feature Selection (Page 512)


Chang, Eric

Inferring Gas Consumption and Pollution Emission of Vehicles Throughout a City (Page 1027)


Chang, Kevin Chen-Chuan

Unifying Learning to Rank and Domain Adaptation: Enabling Cross-Task Document Scoring (Page 781)


Chang, Yi

Identifying and Labeling Search Tasks via Query-based Hawkes Processes (Page 731)


(Return to Top)

Chapelle, Olivier

Modeling Delayed Feedback in Display Advertising (Page 1097)


Charlin, Laurent

Leveraging User Libraries to Bootstrap Collaborative Filtering (Page 173)


Chau, Duen Horng

Guilt by Association: Large Scale Malware Detection by Mining File-relation Graphs (Page 1524)


Chaudhari, Sneha

FoodSIS: A Text Mining System to Improve the State of Food Safety in Singapore (Page 1709)


Chawla, Nitesh V.

Improving Management of Aquatic Invasions by Integrating Shipping Network, Ecological, and Environmental Data: Data Mining for Social Good (Page 1699)

Inferring User Demographics and Social Strategies in Mobile Social Networks (Page 15)


Chawla, Sanjay

Sleep Analytics and Online Selective Anomaly Detection (Page 362)


Chen, Bee-Chung

Activity Ranking in LinkedIn Feed (Page 1603)


Chen, Bowei

An Empirical Study of Reserve Price Optimisation in Real-Time Bidding (Page 1897)


Chen, Daizhuo

Scalable Hands-Free Transfer Learning for Online Advertising (Page 1573)


Chen, Enhong

GeoMF: Joint Geographical Modeling and Matrix Factorization for Point-of-Interest Recommendation (Page 831)

Mobile App Recommendations with Security and Privacy Awareness (Page 951)


(Return to Top)

Chen, Feng

Beating the News' with EMBERS: Forecasting Civil Unrest Using Open Source Indicators (Page 1799)

Modeling Mass Protest Adoption in Social Network Communities Using Geometric Brownian Motion (Page 1660)

Non-Parametric Scan Statistics for Event Detection and Forecasting in Heterogeneous Social Media Graphs (Page 1166)


Chen, Jiangzhuo

ISIS: A Networked-Epidemiology Based Pervasive Web App for Infectious Disease Pandemic Planning and Response (Page 1847)


Chen, Lei

From Labor to Trader: Opinion Elicitation via Online Crowds as a Market (Page 1067)

TCS: Efficient Topic Discovery Over Crowd-Oriented Service Data (Page 861)


Chen, Rui

Differentially Private Network Data Release via Structural Inference (Page 911)


Chen, Ting

Topic-Factorized Ideal Point Estimation Model for Legislative Voting Network (Page 183)


Chen, Wei

Identifying Tourists from Public Transport Commuters (Page 1779)

Minimizing Seed Set Selection with Probabilistic Coverage Guarantee in a Social Network (Page 1306)


Chen, Wenlin

Fast Flux Discriminant for Large-Scale Sparse Nonlinear Classification (Page 621)


Chen, Xilun

LWI-Svd: Low-rank, Windowed, Incremental Singular Value Decompositions on Time-Evolving Data Sets (Page 987)


(Return to Top)

Chen, Ying

Automated Hypothesis Generation Based on Mining Scientific Literature (Page 1877)


Chen, Yixin

Fast Flux Discriminant for Large-Scale Sparse Nonlinear Classification (Page 621)


Chen, Yuqiang

Efficient Mini-Batch Training for Stochastic Optimization (Page 661)


Chen, Zhe

Integrating Spreadsheet Data via Accurate and Low-Effort Extraction (Page 1126)


Chen, Zhiyuan

Mining Topics in Documents: Standing on the Shoulders of Big Data (Page 1116)


Cheng, Dehua

FBLG: A Simple and Effective Approach for Temporal Dependence Discovery from Time Series Data (Page 382)

Parallel Gibbs Sampling for Hierarchical Dirichlet Processes via Gamma Processes Equivalence (Page 562)


Chenthamarakshan, Vijil

Predicting Employee Expertise for Talent Management in the Enterprise (Page 1729)


Chia, Chih-Chun

Scalable Noise Mining in Long-Term Electrocardiographic Time-Series to Predict Death Following Heart Attacks (Page 125)


Chierichetti, Flavio

Correlation Clustering in MapReduce (Page 641)


Cho, Junghoo

Who Are Experts Specializing in Landscape Photography? Analyzing Topic-Specific Authority on Content Sharing Services (Page 1506)


(Return to Top)

Cho, Michael H.

Dual Beta Process Priors for Latent Cluster Discovery in Chronic Obstructive Pulmonary Disease (Page 155)


Clifton, Chris

Top-k Frequent Itemsets via Differentially Private FP-Trees (Page 931)


Clinchant, Stéphane

New Algorithms for Parking Demand Management and a City Scale Deployment (Page 1819)


Cohen, Edith

Distance Queries from Sampled Data: Accurate and Efficient (Page 681)


Comer, Austin

Automated Hypothesis Generation Based on Mining Scientific Literature (Page 1877)


Cong, Gao

COM: A Generative Model for Group Recommendation (Page 163)


Conway, Drew

Data Science Through the Lens of Social Science (Page 1520)


Cormode, Graham

Sampling for Big Data: A Tutorial (Page 1975)


Cox, James

Safe and Efficient Screening for Sparse Support Vector Machine (Page 542)


Cresci, Stefano

EARS (Earthquake Alert and Report System): A Real Time Decision Support System for Earthquake Crisis Management (Page 1749)


(Return to Top)

Cui, Peng

CatchSync: Catching Synchronized Behavior in Large Directed Graphs (Page 941)

FEMA: Flexible Evolutionary Multi-Faceted Analysis for Dynamic Behavioral Pattern Discovery (Page 1186)


Dalessandro, Brian

Industry & Government Track Welcome From Program Chairs

Scalable Hands-Free Transfer Learning for Online Advertising (Page 1573)


Dalvi, Nilesh

Correlation Clustering in MapReduce (Page 641)


Damianou, Andreas

Active Learning for Sparse Bayesian Multilabel Classification (Page 472)


Dance, Christopher

New Algorithms for Parking Demand Management and a City Scale Deployment (Page 1819)


Danescu-Niculescu-Mizil, Cristian

People on Drugs: Credibility of User Statements in Health Communities (Page 65)


Dasu, Tamraparni

Empirical Glitch Explanations (Page 572)


Davidson, Ian

Clinical Risk Prediction with Multilinear Sparse Logistic Regression (Page 145)


(Return to Top)

Dayaram, Tajhal

Automated Hypothesis Generation Based on Mining Scientific Literature (Page 1877)


De Poalo, Tracy

Predictive Modeling in Practice (Page 1517)


de Rijke, Maarten

Personalized Search Result Diversification via Structured Learning (Page 751)


Deng, Alex

Seven Rules of Thumb for Web Site Experimenters (Page 1857)


Deng, Hongbo

Identifying and Labeling Search Tasks via Query-based Hawkes Processes (Page 731)


Der, Matthew F.

Knock It Off: Profiling the Online Storefronts of Counterfeit Merchandise (Page 1759)


Di, Wei

Large Scale Visual Recommendations from Street Fashion Images (Page 1925)


Diao, Qiming

Jointly Modeling Aspects, Ratings and Sentiments for Movie Recommendation (JMARS) (Page 193)


Dilkina, Bistra

Scalable Diffusion-Aware Optimization of Network Topology (Page 1226)


Ding, Bolin

Scalable Near Real-Time Failure Localization of Data Center Networks (Page 1689)


(Return to Top)

Ding, Rui

Correlating Events with Time Series for Incident Diagnosis (Page 1583)


Donehower, Lawrence

Automated Hypothesis Generation Based on Mining Scientific Literature (Page 1877)


Dong, Anlei

Identifying and Labeling Search Tasks via Query-based Hawkes Processes (Page 731)


Dong, Xin Luna

Knowledge Vault: A Web-Scale Approach to Probabilistic Knowledge Fusion (Page 601)


Dong, Yuxiao

Inferring User Demographics and Social Strategies in Mobile Social Networks (Page 15)


Doshi-Velez, Finale

Unfolding Physiological State: Mortality Modelling in Intensive Care Units (Page 75)


Dougherty, Edward

Modeling Mass Protest Adoption in Social Network Communities Using Geometric Brownian Motion (Page 1660)


Doyle, Andy

Beating the News' with EMBERS: Forecasting Civil Unrest Using Open Source Indicators (Page 1799)


Drake, John M.

Improving Management of Aquatic Invasions by Integrating Shipping Network, Ecological, and Environmental Data: Data Mining for Social Good (Page 1699)


Duan, Bing

Applying Data Mining Techniques to Address Critical Process Optimization Needs in Advanced Manufacturing (Page 1739)


(Return to Top)

Duan, Lian

Community Detection in Graphs through Correlation (Page 1376)


Duffield, Nick

Graph Sample and Hold: A Framework for Big-Graph Analytics (Page 1446)

Sampling for Big Data: A Tutorial (Page 1975)


Duggirala, Mayuri

Predicting Student Risks Through Longitudinal Analysis (Page 1544)


Dundar, Murat

Batch Discovery of Recurring Rare Classes Toward Identifying Anomalous Samples (Page 223)


Dy, Jennifer G.

Dual Beta Process Priors for Latent Cluster Discovery in Chronic Obstructive Pulmonary Disease (Page 155)


E., Shawn

Welcome from Bloomberg


Eagle, Nathan

Big Data for Social Good (Page 1522)


El-Kishky, Ahmed

Bringing Structure to Text: Mining Phrases, Entities, Topics, and Hierarchies (Page 1968)


(Return to Top)

Ellenberger, John

Management and Analytic of Biomedical Big Data with Cloud-Based In-Memory Database and Dynamic Querying: A Hands-on Experience with Real-world Data (Page 1970)


Embar, Varun R.

A Bayesian Framework for Estimating Properties of Network Diffusions (Page 1216)


Emrich, Tobias

Representative Clustering of Uncertain Data (Page 243)


Eneva, Elena

Early Prediction of Code Blue Using Electronic Medical Records (Page 1917)


Ester, Martin

Spatially Embedded Co-Offence Prediction Using Supervised Learning (Page 1789)


Etzioni, Oren

Open Question Answering Over Curated and Extracted Knowledge Bases (Page 1156)

The Battle for the Future of Data Mining (Page 1)


Evans, James A.

Active Collaborative Permutation Learning (Page 502)


Fader, Anthony

Open Question Answering Over Curated and Extracted Knowledge Bases (Page 1156)


(Return to Top)

Faloutsos, Christos

CatchSync: Catching Synchronized Behavior in Large Directed Graphs (Page 941)

Detecting Anomalies in Dynamic Rating Data: A Robust Probabilistic Model for Rating Evolution (Page 841)

FUNNEL: Automatic Mining of Spatially Coevolving Epidemics (Page 105)

Good-Enough Brain Model: Challenges, Algorithms and Discoveries in Multi-Subject Experiments (Page 95)


Fan, Wei

Class-Distribution Regularized Consensus Maximization for Alleviating Overfitting in Model Combination (Page 303)

Efficient Multi-Task Feature Learning with Calibration (Page 761)

Inside the Atoms: Ranking on a Network of Networks (Page 1356)

Supervised Deep Learning with Auxiliary Networks (Page 353)


Fancher, Scott W.

Predicting Employee Expertise for Talent Management in the Enterprise (Page 1729)


Fang, Dongping

Predicting Employee Expertise for Talent Management in the Enterprise (Page 1729)


Fang, Meng

Networked Bandits with Disjoint Linear Payoffs (Page 1106)


Fang, Xiaomin

Fast Dtt — A Near Linear Algorithm for Decomposing A Tensor into Factor Tensors (Page 967)


Färber, Ines

SMVC: Semi-Supervised Multi-View Clustering in Subspace Projections (Page 253)


Fayed, Youssef

Beating the News' with EMBERS: Forecasting Civil Unrest Using Open Source Indicators (Page 1799)


(Return to Top)

Feng, Jing

Relevant Overlapping Subspace Clusters on Categorical Data (Page 213)


Feng, Mengling

Management and Analytic of Biomedical Big Data with Cloud-Based In-Memory Database and Dynamic Querying: A Hands-on Experience with Real-world Data (Page 1970)


Fernandez, Joseph

Unveiling Clusters of Events for Alert and Incident Management in Large-Scale Enterprise IT (Page 1630)


Fiss, Juliet

Mining Text Snippets for Images on the Web (Page 1534)


Fitter, Percy

Scalable Near Real-Time Failure Localization of Data Center Networks (Page 1689)


Ford, Jim

Beating the News' with EMBERS: Forecasting Civil Unrest Using Open Source Indicators (Page 1799)


Fradkin, Dmitriy

Log-based Predictive Maintenance (Page 1867)


Fu, Qiang

Correlating Events with Time Series for Incident Diagnosis (Page 1583)


Fu, Yanjie

Exploiting Geographic Dependencies for Real Estate Appraisal: A Mutual Perspective of Ranking and Clustering (Page 1047)


Fyshe, Alona

Good-Enough Brain Model: Challenges, Algorithms and Discoveries in Multi-Subject Experiments (Page 95)