XGBoost: A Scalable Tree Boosting System |
Tianqi Chen, Carlos Guestrin |
|
|
|
code |
12882 |
node2vec: Scalable Feature Learning for Networks |
Aditya Grover, Jure Leskovec |
|
|
|
code |
4647 |
"Why Should I Trust You?": Explaining the Predictions of Any Classifier |
Marco Túlio Ribeiro, Sameer Singh, Carlos Guestrin |
|
|
|
code |
4561 |
Structural Deep Network Embedding |
Daixin Wang, Peng Cui, Wenwu Zhu |
|
|
|
code |
1374 |
Collaborative Knowledge Base Embedding for Recommender Systems |
Fuzheng Zhang, Nicholas Jing Yuan, Defu Lian, Xing Xie, WeiYing Ma |
|
|
|
code |
717 |
Asymmetric Transitivity Preserving Graph Embedding |
Mingdong Ou, Peng Cui, Jian Pei, Ziwei Zhang, Wenwu Zhu |
|
|
|
code |
570 |
Interpretable Decision Sets: A Joint Framework for Description and Prediction |
Himabindu Lakkaraju, Stephen H. Bach, Jure Leskovec |
|
|
|
code |
256 |
Recurrent Marked Temporal Point Processes: Embedding Event History to Vector |
Nan Du, Hanjun Dai, Rakshit Trivedi, Utkarsh Upadhyay, Manuel GomezRodriguez, Le Song |
|
|
|
code |
232 |
Convolutional Neural Networks for Steady Flow Approximation |
Xiaoxiao Guo, Wei Li, Francesco Iorio |
|
|
|
code |
222 |
Multi-layer Representation Learning for Medical Concepts |
Edward Choi, Mohammad Taha Bahadori, Elizabeth Searles, Catherine Coffey, Michael Thompson, James Bost, Javier TejedorSojo, Jimeng Sun |
|
|
|
code |
218 |
Deep Crossing: Web-Scale Modeling without Manually Crafted Combinatorial Features |
Ying Shan, T. Ryan Hoens, Jian Jiao, Haijing Wang, Dong Yu, J. C. Mao |
|
|
|
code |
203 |
CNTK: Microsoft's Open-Source Deep-Learning Toolkit |
Frank Seide, Amit Agarwal |
|
|
|
code |
203 |
Algorithmic Bias: From Discrimination Discovery to Fairness-aware Data Mining |
Sara Hajian, Francesco Bonchi, Carlos Castillo |
|
|
|
code |
192 |
Towards Conversational Recommender Systems |
Konstantina Christakopoulou, Filip Radlinski, Katja Hofmann |
|
|
|
code |
166 |
Point-of-Interest Recommendations: Learning Potential Check-ins from Friends |
Huayu Li, Yong Ge, Richang Hong, Hengshu Zhu |
|
|
|
code |
153 |
Deep Visual-Semantic Hashing for Cross-Modal Retrieval |
Yue Cao, Mingsheng Long, Jianmin Wang, Qiang Yang, Philip S. Yu |
|
|
|
code |
153 |
FRAUDAR: Bounding Graph Fraud in the Face of Camouflage |
Bryan Hooi, Hyun Ah Song, Alex Beutel, Neil Shah, Kijung Shin, Christos Faloutsos |
|
|
|
code |
147 |
Rebalancing Bike Sharing Systems: A Multi-source Data Smart Optimization |
Junming Liu, Leilei Sun, Weiwei Chen, Hui Xiong |
|
|
|
code |
125 |
FINAL: Fast Attributed Network Alignment |
Si Zhang, Hanghang Tong |
|
|
|
code |
120 |
Extreme Multi-label Loss Functions for Recommendation, Tagging, Ranking & Other Missing Label Applications |
Himanshu Jain, Yashoteja Prabhu, Manik Varma |
|
|
|
code |
111 |
Smart Reply: Automated Response Suggestion for Email |
Anjuli Kannan, Karol Kurach, Sujith Ravi, Tobias Kaufmann, Andrew Tomkins, Balint Miklos, Greg Corrado, László Lukács, Marina Ganea, Peter Young, Vivek Ramavajjala |
|
|
|
code |
111 |
Topic Modeling of Short Texts: A Pseudo-Document View |
Yuan Zuo, Junjie Wu, Hui Zhang, Hao Lin, Fei Wang, Ke Xu, Hui Xiong |
|
|
|
code |
109 |
GMove: Group-Level Mobility Modeling Using Geo-Tagged Social Media |
Chao Zhang, Keyang Zhang, Quan Yuan, Luming Zhang, Tim Hanratty, Jiawei Han |
|
|
|
code |
108 |
Meta Structure: Computing Relevance in Large Heterogeneous Information Networks |
Zhipeng Huang, Yudian Zheng, Reynold Cheng, Yizhou Sun, Nikos Mamoulis, Xiang Li |
|
|
|
code |
107 |
Latent Space Model for Road Networks to Predict Time-Varying Traffic |
Dingxiong Deng, Cyrus Shahabi, Ugur Demiryurek, Linhong Zhu, Rose Yu, Yan Liu |
|
|
|
code |
105 |
Crime Rate Inference with Big Data |
Hongjian Wang, Daniel Kifer, Corina Graif, Zhenhui Li |
|
|
|
code |
96 |
User Identity Linkage by Latent User Space Modelling |
Xin Mu, Feida Zhu, EePeng Lim, Jing Xiao, Jianzong Wang, ZhiHua Zhou |
|
|
|
code |
95 |
Predicting Disk Replacement towards Reliable Data Centers |
Mirela Madalina Botezatu, Ioana Giurgiu, Jasmina Bogojeska, Dorothea Wiesmann |
|
|
|
code |
89 |
Fast Memory-efficient Anomaly Detection in Streaming Heterogeneous Graphs |
Emaad A. Manzoor, Sadegh M. Milajerdi, Leman Akoglu |
|
|
|
code |
75 |
Robust Influence Maximization |
Wei Chen, Tian Lin, Zihan Tan, Mingfei Zhao, Xuren Zhou |
|
|
|
code |
74 |
Extracting Optimal Performance from Dynamic Time Warping |
Abdullah Mueen, Eamonn J. Keogh |
|
|
|
code |
70 |
Unified Point-of-Interest Recommendation with Temporal Interval Assessment |
Yanchi Liu, Chuanren Liu, Bin Liu, Meng Qu, Hui Xiong |
|
|
|
code |
66 |
Anomaly Detection Using Program Control Flow Graph Mining From Execution Logs |
Animesh Nandi, Atri Mandal, Shubham Atreja, Gargi Banerjee Dasgupta, Subhrajit Bhattacharya |
|
|
|
code |
65 |
Ranking Relevance in Yahoo Search |
Dawei Yin, Yuening Hu, Jiliang Tang, Tim Daly Jr., Mianwei Zhou, Hua Ouyang, Jianhui Chen, Changsung Kang, Hongbo Deng, Chikashi Nobata, JeanMarc Langlois, Yi Chang |
|
|
|
code |
64 |
DeepIntent: Learning Attentions for Online Advertising with Recurrent Neural Networks |
Shuangfei Zhai, Kenghao Chang, Ruofei Zhang, Zhongfei (Mark) Zhang |
|
|
|
code |
64 |
Online Context-Aware Recommendation with Time Varying Multi-Armed Bandit |
Chunqiu Zeng, Qing Wang, Shekoofeh Mokhtari, Tao Li |
|
|
|
code |
63 |
A Multi-Task Learning Formulation for Survival Analysis |
Yan Li, Jie Wang, Jieping Ye, Chandan K. Reddy |
|
|
|
code |
62 |
Dynamic Clustering of Streaming Short Documents |
Shangsong Liang, Emine Yilmaz, Evangelos Kanoulas |
|
|
|
code |
60 |
Fast Unsupervised Online Drift Detection Using Incremental Kolmogorov-Smirnov Test |
Denis Moreira dos Reis, Peter A. Flach, Stan Matwin, Gustavo E. A. P. A. Batista |
|
|
|
code |
60 |
Aircraft Trajectory Prediction Made Easy with Predictive Analytics |
Samet Ayhan, Hanan Samet |
|
|
|
code |
59 |
Partial Label Learning via Feature-Aware Disambiguation |
MinLing Zhang, BinBin Zhou, XuYing Liu |
|
|
|
code |
58 |
Compressing Graphs and Indexes with Recursive Graph Bisection |
Laxman Dhulipala, Igor Kabiljo, Brian Karrer, Giuseppe Ottaviano, Sergey Pupyrev, Alon Shalita |
|
|
|
code |
57 |
Accelerating Online CP Decompositions for Higher Order Tensors |
Shuo Zhou, Xuan Vinh Nguyen, James Bailey, Yunzhe Jia, Ian Davidson |
|
|
|
code |
56 |
Compressing Convolutional Neural Networks in the Frequency Domain |
Wenlin Chen, James T. Wilson, Stephen Tyree, Kilian Q. Weinberger, Yixin Chen |
|
|
|
code |
56 |
Transfer Knowledge between Cities |
Ying Wei, Yu Zheng, Qiang Yang |
|
|
|
code |
55 |
Robust Extreme Multi-label Learning |
Chang Xu, Dacheng Tao, Chao Xu |
|
|
|
code |
54 |
Understanding Behaviors that Lead to Purchasing: A Case Study of Pinterest |
Caroline Lo, Dan Frankowski, Jure Leskovec |
|
|
|
code |
51 |
Joint Community and Structural Hole Spanner Detection via Harmonic Modularity |
Lifang He, ChunTa Lu, Jiaqi Ma, Jianping Cao, Linlin Shen, Philip S. Yu |
|
|
|
code |
51 |
Infinite Ensemble for Image Clustering |
Hongfu Liu, Ming Shao, Sheng Li, Yun Fu |
|
|
|
code |
50 |
Label Noise Reduction in Entity Typing by Heterogeneous Partial-Label Embedding |
Xiang Ren, Wenqi He, Meng Qu, Clare R. Voss, Heng Ji, Jiawei Han |
|
|
|
code |
48 |
Robust Influence Maximization |
Xinran He, David Kempe |
|
|
|
code |
47 |
Repeat Buyer Prediction for E-Commerce |
Guimei Liu, Tam T. Nguyen, Gang Zhao, Wei Zha, Jianbo Yang, Jianneng Cao, Min Wu, Peilin Zhao, Wei Chen |
|
|
|
code |
45 |
Catch Me If You Can: Detecting Pickpocket Suspects from Large-Scale Transit Records |
Bowen Du, Chuanren Liu, Wenjun Zhou, Zhenshan Hou, Hui Xiong |
|
|
|
code |
45 |
GLMix: Generalized Linear Mixed Models For Large-Scale Response Prediction |
XianXing Zhang, Yitong Zhou, Yiming Ma, BeeChung Chen, Liang Zhang, Deepak Agarwal |
|
|
|
code |
45 |
Approximate Personalized PageRank on Dynamic Graphs |
Hongyang Zhang, Peter Lofgren, Ashish Goel |
|
|
|
code |
43 |
FASCINATE: Fast Cross-Layer Dependency Inference on Multi-layered Networks |
Chen Chen, Hanghang Tong, Lei Xie, Lei Ying, Qing He |
|
|
|
code |
43 |
IoT Big Data Stream Mining |
Gianmarco De Francisci Morales, Albert Bifet, Latifur Khan, João Gama, Wei Fan |
|
|
|
code |
42 |
Data-Driven Metric Development for Online Controlled Experiments: Seven Lessons Learned |
Alex Deng, Xiaolin Shi |
|
|
|
code |
41 |
Matrix Computations and Optimization in Apache Spark |
Reza Bosagh Zadeh, Xiangrui Meng, Alexander Ulanov, Burak Yavuz, Li Pu, Shivaram Venkataraman, Evan R. Sparks, Aaron Staple, Matei Zaharia |
|
|
|
code |
41 |
Towards Confidence in the Truth: A Bootstrapping based Truth Discovery Approach |
Houping Xiao, Jing Gao, Qi Li, Fenglong Ma, Lu Su, Yunlong Feng, Aidong Zhang |
|
|
|
code |
41 |
Online Optimization Methods for the Quantification Problem |
Purushottam Kar, Shuai Li, Harikrishna Narasimhan, Sanjay Chawla, Fabrizio Sebastiani |
|
|
|
code |
39 |
Learning Cumulatively to Become More Knowledgeable |
Geli Fei, Shuai Wang, Bing Liu |
|
|
|
code |
38 |
Recruitment Market Trend Analysis with Sequential Latent Variable Models |
Chen Zhu, Hengshu Zhu, Hui Xiong, Pengliang Ding, Fang Xie |
|
|
|
code |
37 |
Talent Circle Detection in Job Transition Networks |
Huang Xu, Zhiwen Yu, Jingyuan Yang, Hui Xiong, Hengshu Zhu |
|
|
|
code |
36 |
PTE: Enumerating Trillion Triangles On Distributed Systems |
HaMyung Park, SungHyon Myaeng, U Kang |
|
|
|
code |
36 |
Contextual Intent Tracking for Personal Assistants |
Yu Sun, Nicholas Jing Yuan, Yingzi Wang, Xing Xie, Kieran McDonald, Rui Zhang |
|
|
|
code |
35 |
Modeling Precursors for Event Forecasting via Nested Multi-Instance Learning |
Yue Ning, Sathappan Muthiah, Huzefa Rangwala, Naren Ramakrishnan |
|
|
|
code |
35 |
Diversified Temporal Subgraph Pattern Mining |
Yi Yang, Da Yan, Huanhuan Wu, James Cheng, Shuigeng Zhou, John C. S. Lui |
|
|
|
code |
35 |
Overcoming Key Weaknesses of Distance-based Neighbourhood Methods using a Data Dependent Dissimilarity Measure |
Kai Ming Ting, Ye Zhu, Mark James Carman, Yue Zhu, ZhiHua Zhou |
|
|
|
code |
34 |
Skinny-dip: Clustering in a Sea of Noise |
Samuel Maurus, Claudia Plant |
|
|
|
code |
34 |
Structural Neighborhood Based Classification of Nodes in a Network |
Sharad Nandanwar, M. Narasimha Murty |
|
|
|
code |
34 |
City-Scale Map Creation and Updating using GPS Collections |
Chen Chen, Cewu Lu, Qixing Huang, Qiang Yang, Dimitrios Gunopulos, Leonidas J. Guibas |
|
|
|
code |
34 |
Portfolio Selections in P2P Lending: A Multi-Objective Perspective |
Hongke Zhao, Qi Liu, Guifeng Wang, Yong Ge, Enhong Chen |
|
|
|
code |
33 |
AnyDBC: An Efficient Anytime Density-based Clustering Algorithm for Very Large Complex Datasets |
Son T. Mai, Ira Assent, Martin Storgaard |
|
|
|
code |
33 |
Scalable Pattern Matching over Compressed Graphs via Dedensification |
Antonio Maccioni, Daniel J. Abadi |
|
|
|
code |
32 |
TRIÈST: Counting Local and Global Triangles in Fully-Dynamic Streams with Fixed Memory Size |
Lorenzo De Stefani, Alessandro Epasto, Matteo Riondato, Eli Upfal |
|
|
|
code |
32 |
Just One More: Modeling Binge Watching Behavior |
William Trouleau, Azin Ashkan, Weicong Ding, Brian Eriksson |
|
|
|
code |
32 |
Ranking Causal Anomalies via Temporal and Dynamical Analysis on Vanishing Correlations |
Wei Cheng, Kai Zhang, Haifeng Chen, Guofei Jiang, Zhengzhang Chen, Wei Wang |
|
|
|
code |
31 |
Beyond Sigmoids: The NetTide Model for Social Network Growth, and Its Applications |
Chengxi Zang, Peng Cui, Christos Faloutsos |
|
|
|
code |
31 |
An Empirical Study on Recommendation with Multiple Types of Feedback |
Liang Tang, Bo Long, BeeChung Chen, Deepak Agarwal |
|
|
|
code |
30 |
Taxi Driving Behavior Analysis in Latent Vehicle-to-Vehicle Networks: A Social Influence Perspective |
Tong Xu, Hengshu Zhu, Xiangyu Zhao, Qi Liu, Hao Zhong, Enhong Chen, Hui Xiong |
|
|
|
code |
30 |
Data-driven Automatic Treatment Regimen Development and Recommendation |
Leilei Sun, Chuanren Liu, Chonghui Guo, Hui Xiong, Yanming Xie |
|
|
|
code |
29 |
A Text Clustering Algorithm Using an Online Clustering Scheme for Initialization |
Jianhua Yin, Jianyong Wang |
|
|
|
code |
29 |
Probabilistic Robust Route Recovery with Spatio-Temporal Dynamics |
Hao Wu, Jiangyun Mao, Weiwei Sun, Baihua Zheng, Hanyuan Zhang, Ziyang Chen, Wei Wang |
|
|
|
code |
29 |
Hierarchical Incomplete Multi-source Feature Learning for Spatiotemporal Event Forecasting |
Liang Zhao, Jieping Ye, Feng Chen, ChangTien Lu, Naren Ramakrishnan |
|
|
|
code |
29 |
Bid-aware Gradient Descent for Unbiased Learning with Censored Data in Display Advertising |
Weinan Zhang, Tianxiong Zhou, Jun Wang, Jian Xu |
|
|
|
code |
28 |
Improving the Sensitivity of Online Controlled Experiments: Case Studies at Netflix |
Huizhi Xie, Juliette Aurisset |
|
|
|
code |
27 |
An Engagement-Based Customer Lifetime Value System for E-commerce |
Ali Vanderveld, Addhyan Pandey, Angela Han, Rajesh Parekh |
|
|
|
code |
26 |
Predicting Matchups and Preferences in Context |
Shuo Chen, Thorsten Joachims |
|
|
|
code |
26 |
Domain Adaptation in the Absence of Source Domain Data |
Boris Chidlovskii, Stéphane Clinchant, Gabriela Csurka |
|
|
|
code |
26 |
Firebird: Predicting Fire Risk and Prioritizing Fire Inspections in Atlanta |
Michael A. Madaio, ShangTse Chen, Oliver L. Haimson, Wenwen Zhang, Xiang Cheng, Matthew HindsAldrich, Duen Horng Chau, Bistra Dilkina |
|
|
|
code |
26 |
Developing a Data-Driven Player Ranking in Soccer Using Predictive Model Weights |
Joel Brooks, Matthew Kerr, John V. Guttag |
|
|
|
code |
25 |
Kam1n0: MapReduce-based Assembly Clone Search for Reverse Engineering |
Steven H. H. Ding, Benjamin C. M. Fung, Philippe Charland |
|
|
|
code |
25 |
A Subsequence Interleaving Model for Sequential Pattern Mining |
Jaroslav M. Fowkes, Charles Sutton |
|
|
|
code |
25 |
ABRA: Approximating Betweenness Centrality in Static and Dynamic Graphs with Rademacher Averages |
Matteo Riondato, Eli Upfal |
|
|
|
code |
25 |
Identifying Police Officers at Risk of Adverse Events |
Samuel Carton, Jennifer Helsby, Kenneth Joseph, Ayesha Mahmud, Youngsoo Park, Joe Walsh, Crystal Cody, C. P. T. Estella Patterson, Lauren Haynes, Rayid Ghani |
|
|
|
code |
25 |
Semi-Markov Switching Vector Autoregressive Model-Based Anomaly Detection in Aviation Systems |
Igor Melnyk, Arindam Banerjee, Bryan L. Matthews, Nikunj C. Oza |
|
|
|
code |
25 |
Unbounded Human Learning: Optimal Scheduling for Spaced Repetition |
Siddharth Reddy, Igor Labutov, Siddhartha Banerjee, Thorsten Joachims |
|
|
|
code |
25 |
Reconstructing an Epidemic Over Time |
Polina Rozenshtein, Aristides Gionis, B. Aditya Prakash, Jilles Vreeken |
|
|
|
code |
25 |
DopeLearning: A Computational Approach to Rap Lyrics Generation |
Eric Malmi, Pyry Takala, Hannu Toivonen, Tapani Raiko, Aristides Gionis |
|
|
|
code |
24 |
Streaming-LDA: A Copula-based Approach to Modeling Topic Dependencies in Document Streams |
Hesam Amoualian, Marianne Clausel, Éric Gaussier, MassihReza Amini |
|
|
|
code |
23 |
Singapore in Motion: Insights on Public Transport Service Level Through Farecard and Mobile Data Analytics |
Hasan Poonawala, Vinay Kolar, Sebastien Blandin, Laura Wynter, Sambit Sahu |
|
|
|
code |
23 |
A Truth Discovery Approach with Theoretical Guarantee |
Houping Xiao, Jing Gao, Zhaoran Wang, Shiyu Wang, Lu Su, Han Liu |
|
|
|
code |
23 |
Targeted Topic Modeling for Focused Analysis |
Shuai Wang, Zhiyuan Chen, Geli Fei, Bing Liu, Sherry Emery |
|
|
|
code |
22 |
Structured Doubly Stochastic Matrix for Graph Based Clustering: Structured Doubly Stochastic Matrix |
Xiaoqian Wang, Feiping Nie, Heng Huang |
|
|
|
code |
22 |
Regime Shifts in Streams: Real-time Forecasting of Co-evolving Time Sequences |
Yasuko Matsubara, Yasushi Sakurai |
|
|
|
code |
20 |
MANTRA: A Scalable Approach to Mining Temporally Anomalous Sub-trajectories |
Prithu Banerjee, Pranali Yawalkar, Sayan Ranu |
|
|
|
code |
20 |
Finding Gangs in War from Signed Networks |
Lingyang Chu, Zhefeng Wang, Jian Pei, Jiannan Wang, Zijin Zhao, Enhong Chen |
|
|
|
code |
20 |
Large-Scale Item Categorization in e-Commerce Using Multiple Recurrent Neural Networks |
JungWoo Ha, Hyuna Pyo, Jeonghee Kim |
|
|
|
code |
19 |
Images Don't Lie: Transferring Deep Visual Semantic Features to Large-Scale Multimodal Learning to Rank |
Corey Lynch, Kamelia Aryafar, Josh Attenberg |
|
|
|
code |
19 |
Gemello: Creating a Detailed Energy Breakdown from Just the Monthly Electricity Bill |
Nipun Batra, Amarjeet Singh, Kamin Whitehouse |
|
|
|
code |
19 |
Multi-Task Feature Interaction Learning |
Kaixiang Lin, Jianpeng Xu, Inci M. Baytas, Shuiwang Ji, Jiayu Zhou |
|
|
|
code |
19 |
Boosted Decision Tree Regression Adjustment for Variance Reduction in Online Controlled Experiments |
Alexey Poyarkov, Alexey Drutsa, Andrey Khalyavin, Gleb Gusev, Pavel Serdyukov |
|
|
|
code |
19 |
Question Independent Grading using Machine Learning: The Case of Computer Program Grading |
Gursimran Singh, Shashank Srikant, Varun Aggarwal |
|
|
|
code |
19 |
When Social Influence Meets Item Inference |
HuiJu Hung, HongHan Shuai, DeNian Yang, LiangHao Huang, WangChien Lee, Jian Pei, MingSyan Chen |
|
|
|
code |
19 |
Online Asymmetric Active Learning with Imbalanced Data |
Xiaoxuan Zhang, Tianbao Yang, Padmini Srinivasan |
|
|
|
code |
19 |
Days on Market: Measuring Liquidity in Real Estate Markets |
Hengshu Zhu, Hui Xiong, Fangshuang Tang, Qi Liu, Yong Ge, Enhong Chen, Yanjie Fu |
|
|
|
code |
19 |
Mining Subgroups with Exceptional Transition Behavior |
Florian Lemmerich, Martin Becker, Philipp Singer, Denis Helic, Andreas Hotho, Markus Strohmaier |
|
|
|
code |
18 |
Parallel Dual Coordinate Descent Method for Large-scale Linear Classification in Multi-core Environments |
WeiLin Chiang, MuChu Lee, ChihJen Lin |
|
|
|
code |
18 |
Scalable Betweenness Centrality Maximization via Sampling |
Ahmad Mahmoody, Charalampos E. Tsourakakis, Eli Upfal |
|
|
|
code |
18 |
Keeping it Short and Simple: Summarising Complex Event Sequences with Multivariate Patterns |
Roel Bertens, Jilles Vreeken, Arno Siebes |
|
|
|
code |
18 |
Evaluating Mobile Apps with A/B and Quasi A/B Tests |
Ya Xu, Nanyu Chen |
|
|
|
code |
17 |
Email Volume Optimization at LinkedIn |
Rupesh Gupta, Guanfeng Liang, HsiaoPing Tseng, Ravi Kiran Holur Vijay, Xiaoyu Chen, Rómer Rosales |
|
|
|
code |
17 |
EMBERS at 4 years: Experiences operating an Open Source Indicators Forecasting System |
Sathappan Muthiah, Patrick Butler, Rupinder Paul Khandpur, Parang Saraf, Nathan Self, Alla Rozovskaya, Liang Zhao, Jose Cadena, ChangTien Lu, Anil Vullikanti, Achla Marathe, Kristen Maria Summers, Graham Katz, Andy Doyle, Jaime Arredondo, Dipak K. Gupta, David Mares, Naren Ramakrishnan |
|
|
|
code |
17 |
How to Get Them a Dream Job?: Entity-Aware Features for Personalized Job Search Ranking |
Jia Li, Dhruv Arya, Viet HaThuc, Shakti Sinha |
|
|
|
code |
16 |
FUSE: Full Spectral Clustering |
Wei Ye, Sebastian Goebl, Claudia Plant, Christian Böhm |
|
|
|
code |
16 |
Positive-Unlabeled Learning in Streaming Networks |
Shiyu Chang, Yang Zhang, Jiliang Tang, Dawei Yin, Yi Chang, Mark A. HasegawaJohnson, Thomas S. Huang |
|
|
|
code |
16 |
Goal-Directed Inductive Matrix Completion |
Si Si, KaiYang Chiang, ChoJui Hsieh, Nikhil Rao, Inderjit S. Dhillon |
|
|
|
code |
16 |
From Truth Discovery to Trustworthy Opinion Discovery: An Uncertainty-Aware Quantitative Modeling Approach |
Mengting Wan, Xiangyu Chen, Lance M. Kaplan, Jiawei Han, Jing Gao, Bo Zhao |
|
|
|
code |
16 |
Revisiting Random Binning Features: Fast Convergence and Strong Parallelizability |
Lingfei Wu, Ian EnHsu Yen, Jie Chen, Rui Yan |
|
|
|
code |
15 |
Deploying Analytics with the Portable Format for Analytics (PFA) |
Jim Pivarski, Collin Bennett, Robert L. Grossman |
|
|
|
code |
15 |
Efficient Processing of Network Proximity Queries via Chebyshev Acceleration |
Mustafa Coskun, Ananth Grama, Mehmet Koyutürk |
|
|
|
code |
15 |
CaSMoS: A Framework for Learning Candidate Selection Models over Structured Queries and Documents |
Fedor Borisyuk, Krishnaram Kenthapadi, David Stein, Bo Zhao |
|
|
|
code |
14 |
CompanyDepot: Employer Name Normalization in the Online Recruitment Industry |
Qiaoling Liu, Faizan Javed, Matt McNair |
|
|
|
code |
14 |
Towards Optimal Cardinality Estimation of Unions and Intersections with Sketches |
Daniel Ting |
|
|
|
code |
14 |
The Legislative Influence Detector: Finding Text Reuse in State Legislation |
Matthew Burgess, Eugenia Giraudy, Julian KatzSamuels, Joe Walsh, Derek Willis, Lauren Haynes, Rayid Ghani |
|
|
|
code |
14 |
Smart Broadcasting: Do You Want to be Seen? |
Mohammad Reza Karimi, Erfan Tavakoli, Mehrdad Farajtabar, Le Song, Manuel GomezRodriguez |
|
|
|
code |
14 |
Safe Pattern Pruning: An Efficient Approach for Predictive Pattern Mining |
Kazuya Nakagawa, Shinya Suzumura, Masayuki Karasuyama, Koji Tsuda, Ichiro Takeuchi |
|
|
|
code |
14 |
From Online Behaviors to Offline Retailing |
Ping Luo, Su Yan, Zhiqiang Liu, Zhiyong Shen, Shengwen Yang, Qing He |
|
|
|
code |
13 |
Dynamic and Robust Wildfire Risk Prediction System: An Unsupervised Approach |
Mahsa Salehi, Laura Irina Rusu, Timothy M. Lynar, Anna Phan |
|
|
|
code |
13 |
Come-and-Go Patterns of Group Evolution: A Dynamic Model |
Tianyang Zhang, Peng Cui, Christos Faloutsos, Yunfei Lu, Hao Ye, Wenwu Zhu, Shiqiang Yang |
|
|
|
code |
13 |
Robust Large-Scale Machine Learning in the Cloud |
Steffen Rendle, Dennis Fetterly, Eugene J. Shekita, BorYiing Su |
|
|
|
code |
12 |
Audience Expansion for Online Social Network Advertising |
Haishan Liu, David Pardoe, Kun Liu, Manoj Thakur, Frank Cao, Chongzhe Li |
|
|
|
code |
12 |
CatchTartan: Representing and Summarizing Dynamic Multicontextual Behaviors |
Meng Jiang, Christos Faloutsos, Jiawei Han |
|
|
|
code |
12 |
NetCycle: Collective Evolution Inference in Heterogeneous Information Networks |
Yizhou Zhang, Yun Xiong, Xiangnan Kong, Yangyong Zhu |
|
|
|
code |
12 |
Predicting Socio-Economic Indicators using News Events |
Sunandan Chakraborty, Ashwin Venkataraman, Srikanth Jagabathula, Lakshminarayanan Subramanian |
|
|
|
code |
12 |
Scalable Partial Least Squares Regression on Grammar-Compressed Data Matrices |
Yasuo Tabei, Hiroto Saigo, Yoshihiro Yamanishi, Simon J. Puglisi |
|
|
|
code |
12 |
The Million Domain Challenge: Broadcast Email Prioritization by Cross-domain Recommendation |
Beidou Wang, Martin Ester, Yikang Liao, Jiajun Bu, Yu Zhu, Ziyu Guan, Deng Cai |
|
|
|
code |
11 |
Ranking Universities Based on Career Outcomes of Graduates |
Navneet Kapur, Nikita I. Lytkin, BeeChung Chen, Deepak Agarwal, Igor Perisic |
|
|
|
code |
11 |
QUINT: On Query-Specific Optimal Networks |
Liangyue Li, Yuan Yao, Jie Tang, Wei Fan, Hanghang Tong |
|
|
|
code |
11 |
A Multiple Test Correction for Streams and Cascades of Statistical Hypothesis Tests |
Geoffrey I. Webb, François Petitjean |
|
|
|
code |
11 |
Computational Social Science: Exciting Progress and Future Challenges |
Duncan Watts |
|
|
|
code |
11 |
Efficient Shift-Invariant Dictionary Learning |
Guoqing Zheng, Yiming Yang, Jaime G. Carbonell |
|
|
|
code |
11 |
Convex Optimization for Linear Query Processing under Approximate Differential Privacy |
Ganzhao Yuan, Yin Yang, Zhenjie Zhang, Zhifeng Hao |
|
|
|
code |
10 |
Analyzing Volleyball Match Data from the 2014 World Championships Using Machine Learning Techniques |
Jan Van Haaren, Horesh Ben Shitrit, Jesse Davis, Pascal Fua |
|
|
|
code |
10 |
Scalable Fast Rank-1 Dictionary Learning for fMRI Big Data Analysis |
Xiang Li, Milad Makkie, Binbin Lin, Mojtaba Sedigh Fazli, Ian Davidson, Jieping Ye, Tianming Liu, Shannon Quinn |
|
|
|
code |
10 |
The Limits of Popularity-Based Recommendations, and the Role of Social Ties |
Marco Bressan, Stefano Leucci, Alessandro Panconesi, Prabhakar Raghavan, Erisa Terolli |
|
|
|
code |
10 |
Accelerated Stochastic Block Coordinate Descent with Optimal Sampling |
Aston Zhang, Quanquan Gu |
|
|
|
code |
10 |
Efficient Frequent Directions Algorithm for Sparse Matrices |
Mina Ghashami, Edo Liberty, Jeff M. Phillips |
|
|
|
code |
10 |
Mining Reliable Information from Passively and Actively Crowdsourced Data |
Jing Gao, Qi Li, Bo Zhao, Wei Fan, Jiawei Han |
|
|
|
code |
10 |
Detecting Devastating Diseases in Search Logs |
John Paparrizos, Ryen W. White, Eric Horvitz |
|
|
|
code |
9 |
Engagement Capacity and Engaging Team Formation for Reach Maximization of Online Social Media Platforms |
Alexander G. Nikolaev, Shounak Gore, Venu Govindaraju |
|
|
|
code |
9 |
Joint Optimization of Multiple Performance Metrics in Online Video Advertising |
Sahin Cem Geyik, Sergey Faleev, Jianqiang Shen, Sean O'Donnell, Santanu Kolay |
|
|
|
code |
9 |
MAP: Frequency-Based Maximization of Airline Profits based on an Ensemble Forecasting Approach |
Bo An, Haipeng Chen, Noseong Park, V. S. Subrahmanian |
|
|
|
code |
9 |
EMBERS AutoGSR: Automated Coding of Civil Unrest Events |
Parang Saraf, Naren Ramakrishnan |
|
|
|
code |
9 |
FLASH: Fast Bayesian Optimization for Data Analytic Pipelines |
Yuyu Zhang, Mohammad Taha Bahadori, Hang Su, Jimeng Sun |
|
|
|
code |
9 |
How to Compete Online for News Audience: Modeling Words that Attract Clicks |
Joon Hee Kim, Amin Mantrach, Alejandro Jaimes, Alice Oh |
|
|
|
code |
8 |
Distributing the Stochastic Gradient Sampler for Large-Scale LDA |
Yuan Yang, Jianfei Chen, Jun Zhu |
|
|
|
code |
7 |
Text Mining in Clinical Domain: Dealing with Noise |
Hoang Nguyen, Jon Patrick |
|
|
|
code |
7 |
Subjectively Interesting Component Analysis: Data Projections that Contrast with Prior Expectations |
Bo Kang, Jefrey Lijffijt, Raúl SantosRodriguez, Tijl De Bie |
|
|
|
code |
7 |
Accelerating the Race to Autonomous Cars |
Danny Shapiro |
|
|
|
code |
7 |
Computational Drug Repositioning Using Continuous Self-Controlled Case Series |
Zhaobin Kuang, James A. Thomson, Michael Caldwell, Peggy L. Peissig, Ron M. Stewart, David Page |
|
|
|
code |
7 |
Assessing Human Error Against a Benchmark of Perfection |
Ashton Anderson, Jon M. Kleinberg, Sendhil Mullainathan |
|
|
|
code |
7 |
Inferring Network Effects from Observational Data |
David T. Arbour, Dan Garant, David D. Jensen |
|
|
|
code |
7 |
Robust and Effective Metric Learning Using Capped Trace Norm: Metric Learning via Capped Trace Norm |
Zhouyuan Huo, Feiping Nie, Heng Huang |
|
|
|
code |
7 |
When Recommendation Goes Wrong: Anomalous Link Discovery in Recommendation Networks |
Bryan Perozzi, Michael Schueppert, Jack Saalweachter, Mayur Thakur |
|
|
|
code |
6 |
Collaborative Multi-View Denoising |
Lei Zhang, Shupeng Wang, Xiaoyu Zhang, Yong Wang, Binbin Li, Dinggang Shen, Shuiwang Ji |
|
|
|
code |
6 |
Online Feature Selection: A Limited-Memory Substitution Algorithm and Its Asynchronous Parallel Variation |
Haichuan Yang, Ryohei Fujimaki, Yukitaka Kusumura, Ji Liu |
|
|
|
code |
6 |
Lightweight Monitoring of Distributed Streams |
Arnon Lazerson, Daniel Keren, Assaf Schuster |
|
|
|
code |
6 |
Bayesian Inference of Arrival Rate and Substitution Behavior from Sales Transaction Data with Stockouts |
Benjamin Letham, Lydia M. Letham, Cynthia Rudin |
|
|
|
code |
6 |
Temporal Order-based First-Take-All Hashing for Fast Attention-Deficit-Hyperactive-Disorder Detection |
Hao Hu, Joey VelezGinorio, GuoJun Qi |
|
|
|
code |
6 |
Burstiness Scale: A Parsimonious Model for Characterizing Random Series of Events |
Rodrigo Augusto da Silva Alves, Renato Martins Assunção, Pedro Olmo Stancioli Vaz de Melo |
|
|
|
code |
6 |
Squish: Near-Optimal Compression for Archival of Relational Datasets |
Yihan Gao, Aditya G. Parameswaran |
|
|
|
code |
6 |
A Real Linear and Parallel Multiple Longest Common Subsequences (MLCS) Algorithm |
Yanni Li, Hui Li, Tihua Duan, Sheng Wang, Zhi Wang, Yang Cheng |
|
|
|
code |
6 |
Lossless Separation of Web Pages into Layout Code and Data |
Adi Omari, Benny Kimelfeld, Eran Yahav, Sharon Shoham |
|
|
|
code |
6 |
Dynamics of Large Multi-View Social Networks: Synergy, Cannibalization and Cross-View Interplay |
Yu Shi, Myunghwan Kim, Shaunak Chatterjee, Mitul Tiwari, Souvik Ghosh, Rómer Rosales |
|
|
|
code |
6 |
Compute Job Memory Recommender System Using Machine Learning |
Taraneh Taghavi, Maria Lupetini, Yaron Kretchmer |
|
|
|
code |
5 |
Minimizing Legal Exposure of High-Tech Companies through Collaborative Filtering Methods |
Bo Jin, Chao Che, Kuifei Yu, Yue Qu, Li Guo, Cuili Yao, Ruiyun Yu, Qiang Zhang |
|
|
|
code |
5 |
Lexis: An Optimization Framework for Discovering the Hierarchical Structure of Sequential Data |
Payam Siyari, Bistra Dilkina, Constantine Dovrolis |
|
|
|
code |
5 |
Predictors without Borders: Behavioral Modeling of Product Adoption in Three Developing Countries |
Muhammad Raza Khan, Joshua E. Blumenstock |
|
|
|
code |
5 |
Privacy-preserving Class Ratio Estimation |
Arun Shankar Iyer, J. Saketha Nath, Sunita Sarawagi |
|
|
|
code |
5 |
Sampling of Attributed Networks from Hierarchical Generative Models |
Pablo RoblesGranda, Sebastián Moreno, Jennifer Neville |
|
|
|
code |
5 |
Identifying Decision Makers from Professional Social Networks |
Shipeng Yu, Evangelia Christakopoulou, Abhishek Gupta |
|
|
|
code |
5 |
A Non-parametric Approach to Detect Epileptogenic Lesions using Restricted Boltzmann Machines |
Yijun Zhao, Bilal Ahmed, Thomas Thesen, Karen E. Blackmon, Jennifer G. Dy, Carla E. Brodley, Ruben Kuzniecky, Orrin Devinsky |
|
|
|
code |
5 |
Communication Efficient Distributed Kernel Principal Component Analysis |
MariaFlorina Balcan, Yingyu Liang, Le Song, David P. Woodruff, Bo Xie |
|
|
|
code |
5 |
From Prediction to Action: A Closed-Loop Approach for Data-Guided Network Resource Allocation |
Yanan Bao, Huasen Wu, Xin Liu |
|
|
|
code |
5 |
Lifelong Machine Learning and Computer Reading the Web |
Zhiyuan Chen, Estevam R. Hruschka Jr., Bing Liu |
|
|
|
code |
4 |
Continuous Experience-aware Language Model |
Subhabrata Mukherjee, Stephan Günnemann, Gerhard Weikum |
|
|
|
code |
4 |
Graph Wavelets via Sparse Cuts |
Arlei Silva, XuanHong Dang, Prithwish Basu, Ambuj K. Singh, Ananthram Swami |
|
|
|
code |
4 |
Towards Robust and Versatile Causal Discovery for Business Applications |
Giorgos Borboudakis, Ioannis Tsamardinos |
|
|
|
code |
4 |
Optimal Reserve Prices in Upstream Auctions: Empirical Application on Online Video Advertising |
Miguel Angel Alcobendas Lisbona, Sheide Chammas, Kuangchih Lee |
|
|
|
code |
3 |
Generalized Hierarchical Sparse Model for Arbitrary-Order Interactive Antigenic Sites Identification in Flu Virus Data |
Lei Han, Yu Zhang, XiuFeng Wan, Tong Zhang |
|
|
|
code |
3 |
Predict Risk of Relapse for Patients with Multiple Stages of Treatment of Depression |
Zhi Nie, Pinghua Gong, Jieping Ye |
|
|
|
code |
3 |
Absolute Fused Lasso and Its Application to Genome-Wide Association Studies |
Tao Yang, Jun Liu, Pinghua Gong, Ruiwen Zhang, Xiaotong Shen, Jieping Ye |
|
|
|
code |
3 |
Designing Policy Recommendations to Reduce Home Abandonment in Mexico |
Klaus Ackermann, Eduardo Blancas Reyes, Sue He, Thomas Anderson Keller, Paul van der Boor, Romana Khan, Rayid Ghani, José Carlos González |
|
|
|
code |
2 |
Online Dual Decomposition for Performance and Delivery-Based Distributed Ad Allocation |
Jim C. Huang, Rodolphe Jenatton, Cédric Archambeau |
|
|
|
code |
2 |
The Wisdom of Crowds: Best Practices for Data Prep & Machine Learning Derived from Millions of Data Science Workflows |
Ingo Mierswa |
|
|
|
code |
2 |
People, Computers, and The Hot Mess of Real Data |
Joseph M. Hellerstein |
|
|
|
code |
2 |
Learning Sparse Models at Scale |
Ralf Herbrich |
|
|
|
code |
2 |
Compact and Scalable Graph Neighborhood Sketching |
Takuya Akiba, Yosuke Yano |
|
|
|
code |
2 |
Annealed Sparsity via Adaptive and Dynamic Shrinking |
Kai Zhang, Shandian Zhe, Chaoran Cheng, Zhi Wei, Zhengzhang Chen, Haifeng Chen, Guofei Jiang, Yuan Qi, Jieping Ye |
|
|
|
code |
2 |
Causal Clustering for 1-Factor Measurement Models |
Erich Kummerfeld, Joseph D. Ramsey |
|
|
|
code |
2 |
Parallel Lasso Screening for Big Data Optimization |
Qingyang Li, Shuang Qiu, Shuiwang Ji, Paul M. Thompson, Jieping Ye, Jie Wang |
|
|
|
code |
2 |
Leveraging Propagation for Data Mining: Models, Algorithms and Applications |
B. Aditya Prakash, Naren Ramakrishnan |
|
|
|
code |
2 |
Healthcare Data Mining with Matrix Models |
Fei Wang, Ping Zhang, Joel Dudley |
|
|
|
code |
2 |
Graphons and Machine Learning: Modeling and Estimation of Sparse Massive Networks |
Jennifer T. Chayes |
|
|
|
code |
1 |
Fast Component Pursuit for Large-Scale Inverse Covariance Estimation |
Lei Han, Yu Zhang, Tong Zhang |
|
|
|
code |
1 |
Learning to Learn and Compositionality with Deep Recurrent Neural Networks: Learning to Learn and Compositionality |
Nando de Freitas |
|
|
|
code |
1 |
The Evolving Meaning of Information Security |
Whitfield Diffie |
|
|
|
code |
1 |
Identifying Earmarks in Congressional Bills |
Ellery Wulczyn, Madian Khabsa, Vrushank Vora, Matthew Heston, Joe Walsh, Christopher Berry, Rayid Ghani |
|
|
|
code |
1 |
Batch Model for Batched Timestamps Data Analysis with Application to the SSA Disability Program |
Qingqi Yue, Ao Yuan, Xuan Che, Minh Huynh, Chunxiao Zhou |
|
|
|
code |
1 |
Bayesian Optimization and Embedded Learning Systems |
Jeff Schneider |
|
|
|
code |
1 |
Collective Sensemaking via Social Sensors: Extracting, Profiling, Analyzing, and Predicting Real-world Events |
Yuheng Hu, YuRu Lin, Jiebo Luo |
|
|
|
code |
1 |
Scalable Learning of Graphical Models |
François Petitjean, Geoffrey I. Webb |
|
|
|
code |
1 |
Business Applications of Predictive Modeling at Scale |
Qiang Zhu, Songtao Guo, Paul Ogilvie, Yan Liu |
|
|
|
code |
1 |
Profiling Users from Online Social Behaviors with Applications for Tencent Social Ads |
Ching Law |
|
|
|
code |
0 |
Large-Scale Machine Learning at Verizon: Theory and Applications |
Ashok Srivastava |
|
|
|
code |
0 |
How Machine Learning has Finally Solved Wanamaker's Dilemma |
Oliver Downs |
|
|
|
code |
0 |
Streaming Analytics |
Ashish Gupta, Neera Agarwal |
|
|
|
code |
0 |
A VC View of Investing in ML |
Greg Papadopoulos |
|
|
|
code |
0 |
Big Data Needs Big Dreamers: Lessons from Successful Big Data Investors |
Evangelos Simoudis, Mark Gorenberg, Tim Guleri, Matt Ocko, Greg Sands |
|
|
|
code |
0 |
Can You Teach the Elephant to Dance? AKA: Culture Eats Data Science for Breakfast |
Jonathan D. Becher |
|
|
|
code |
0 |
Scalable Time-Decaying Adaptive Prediction Algorithm |
Yinyan Tan, Zhe Fan, Guilin Li, Fangshan Wang, Zhengbing Li, Shikai Liu, Qiuling Pan, Eric P. Xing, Qirong Ho |
|
|
|
code |
0 |
Optimally Discriminative Choice Sets in Discrete Choice Models: Application to Data-Driven Test Design |
Igor Labutov, Frans Schalekamp, Kelvin Luu, Hod Lipson, Christoph Studer |
|
|
|
code |
0 |
Improving Survey Aggregation with Sparsely Represented Signals |
Tianlin Shi, Forest Agostinelli, Matthew Staib, David P. Wipf, Thomas Moscibroda |
|
|
|
code |
0 |
Scalable Data Analytics Using R: Single Machines to Hadoop Spark Clusters |
JohnMark Agosta, Debraj GuhaThakurta, Robert Horton, Mario Inchiosa, Srini Kumar, Mengyue Zhao |
|
|
|
code |
0 |