Accepted Papers & Stats

Area Papers Accepted Posters Accepted

Information Retrieval 58 54
Knowledge Management 45 41
Databases 24 24
Industry 5 4
Overall 131 124
Acceptance 17% 16%

Overall number of submissions: 772

Winner: Best Interdisciplinary Paper Award

"Learning to Link with Wikipedia"
David Milne (Department of Computer Science)
Ian H. Witten (University of Waikato)

Runner Up: Best Interdisciplinary Paper Award

"Structural Relevance: A Common Basis for the Evaluation of Structured Document Retrieval"
M. S. Ali (University of Toronto)
Mariano P. Consens (University of Toronto)
Gabriella Kazai (Microsoft Research)
Mounia Lalmas (Queen Mar y, U. of London)

Winner: Best Poster

"Trust, Authority and Popularity in Social Information Retrieval"
Gabriella Kazai (Microsoft Research)
Natasa Milic-Frayling (Microsoft Research)

Information Retrieval Papers

"Comparing Metrics across TREC and NTCIR: The Robustness to System Bias"
Tetsuya Sakai (40)

"Modeling Multi-step Relevance Propagation for Expert Finding"
Pavel Serdyukov, Henning Rode, Djoerd Hiemstra (51)

"Improved Query Difficulty Prediction for the Web"
Claudia Hauff, Vanessa Murdock, Ricardo Baeza-Yates (61)

"Achieving both High Precision and High Recall in Near-duplicate Detection"
Lianen Huang, Lei Wang, Xiaoming Li (63)

"Key Blog Distillation: Ranking Aggregates"
Craig Macdonald, Iadh Ounis (72)

"Using Structured Text for Large-Scale Attribute Extraction"
Sujith Ravi, Marius Pasca (107)

"Ranked Feature Fusion Models for Ad Hoc Retrieval"
Jeremy Pickens, Gene Golovchinsky (108)

"Comparing Citation Contexts for Information Retrieval"
Anna Ritchie, Simone Teufel, Stephen Robertson (116)

"Evaluation Methods for Information Access Tasks"
Leif Azzopardi, Vishwa Vinay (120)

"Answering Questions with Authority"
Andrew Hickl (124)

"Generalized Inverse Document Frequency"
Donald Metzler (131)

"To Swing or not to Swing: Learning when (not) to Advertise"
Andrei Broder, Massimiliano Ciaramita, Marcus Fontoura, Evgeniy Gabrilovich, Vanja Josifovski, Donald Metzler, Vanessa Murdock, Vassilis Plachouras (132)

"Tapping on the Potential of Q&A Community by Recommending Answer Provider"
Jinwen Guo, Shengliang Xu, Shenghua Bao, Yong Yu (133)

"A Two-stage Text Mining Model for Information Filtering"
Yuefeng Li, Xujuan Zhou, Peter Bruza, Yue Xu, Raymond Y.K. Lau (203)

"Active Relevance Feedback for Difficult Queries"
zuobing xu, Ram Akella (204)

"A Random Walk on the Red Carpet: Rating Movies with User Reviews and PageRank"
Derry Wijaya, Stephane Bressan (227)

"An Effective Statistical Approach to Blog Post Opinion Retrieval"
Ben He, Craig Macdonald, Jiyin He, Iadh Ounis (266)

"Revisiting the relationship between Document Length and Relevance"
David Losada, Leif Azzopardi, Mark Baillie (285)

"How does Clickthrough Data Reflect Retrieval Quality?"
Filip Radlinski, Madhu Kurup, Thorsten Joachims (332)

"Exploiting Temporal Contexts in Text Classification"
Leonardo Rocha, Fernando Mourao, Adriano Pereira, Marcos Goncalves, Wagner Meira (345)

"AdaSum: An Adaptive Model for Summarization"
Zhang Jin (368)

"A System for Finding Biological Entities that Satisfy Certain Conditions from Texts"
Wei Zhou, Clement Yu, Weiyi Meng (385)

"Are Clickthrough Data Adequate for Learning Web Search Rankings?"
Zhicheng Dou, Ruihua Song, Xiaojie Yuan, Ji-Rong Wen (396)

"Blog Site Search Using Resource Selection"
Jangwon Seo, Bruce Croft (397)

"Automatic Online News Topic Ranking Using Media Focus and User Attention Based on Aging Theory"
Canhui Wang, Min Zhang, Liyun Ru, Shaoping Ma (437)

"Can All Tags Be Used for Search?"
Kerstin Bischoff, Claudiu Firan, Wolfgang Nejdl, Raluca Paiu (504)

"Dr. Searcher and Mr. Browser: A Unified Hyperlink-Click Graph"
Barbara Poblete, Carlos Castillo, Aristides Gionis (541)

"A Densitometric Approach to Web Page Segmentation "
Christian Kohlschutter, Wolfgang Nejdl (549)

"Matching Task Profiles and User Needs in Personalized Web Search"
Julia Luxenburger, Shady Elbassuoni, Gerhard Weikum (583)

"Translation Enhancement: A New Relevance Feedback Method for Cross-Language Information Retrieval"
Daqing He, Dan Wu (585)

"High-Dimensional Descriptor Indexing for Large Multimedia Databases"
Eduardo Valle, Matthieu Cord, Sylvie Philipp-Foliguet (595)

"Social Tags: Meanings and Suggestions"
Fabian Suchanek, Milan Vojnovic, Dinan Gunawardena (617)

"A Generative Retrieval Model for Structured Documents"
Le Zhao, Jamie Callan (628)

"Cache-disk-cpu-aware Load Balancing for Question Answering"
David Dominguez-Sal, Mihai Surdeanu, Josep Aguilar-Saborit, Josep-LL. Larriba-Pey (653)

"How evaluator domain expertise affects search result relevance judgements"
Kenneth Kinney, Scott Huffman, Juting Zhai (663)

"On Low Dimensional Random Projections and Similarity Search"
Yu-En Lu, Pietro Lio, Steven Hand (681)

"Structural Relevance: A Common Basis for the Evaluation of Structured Document Retrieval"
Sadek Ali, Mariano Consens, Gabriella Kazai, Mounia Lalmas (684)

"Muti-Aspect Expertise Matching for Review Assignment"
Maryam Karimzadehgan, ChengXiang Zhai, Belford Geneva (700)

"Kernel Methods; Syntax and Semantics for Relational Text Categorization"
Moschitti Alessandro (704)

"Trada: Tree Based Ranking Function Adaptation"
Keke Chen, Rongqing Lu, CK Wong, Gordon Sun, Larry Heck, Belle Tseng (705)

"Adaptive Distributed Indexing for Structured Peer-to-Peer Networks"
Linh Nguyen, Wai Gen Yee, Frieder Ophir (720)

"Efficient and Effective Link Analysis with Precomputed SALSA Maps"
Marc Najork, Nick Craswell (794)

"Modeling Hidden Topics on Document Manifold"
Deng Cai (812)

"Beyond the Session Timeout: Automatic Hierarchical Segmentation of Search Topics in Query Logs"
Rosie Jones, Kristina Klinkner (838)

"TinyLex: Static N-Gram Index Pruning with Perfect Recall"
Derrick Coetzee (840)

"Vanity Fair: Privacy in Querylog Bundles"
Rosie Jones, Ravi Kumar, Bo Pang, Andrew Tomkins (851)

"Understanding the Relationship between Searchers' Queries and Information Goals"
Doug Downey, Dan Liebling, Susan Dumais (878)

"SoRec: Social Recommendation Using Probabilistic Matrix Factorization"
Hao Ma, Haixuan Yang, Michael R. Lyu, Irwin King (885)

"Mining Social Networks Using Heat Diffusion Processes for Marketing Candidates Selection"
Hao Ma, Haixuan Yang, Michael R. Lyu, Irwin King (894)

"Search Advertising using Web Relevance Feedback"
Andrei Broder, Peter Ciccolo, Marcus Fontoura, Evgeniy Gabrilovich, Vanja Josifovski, Lance Riedel (900)

"Learning Latent Semantic Relations from Query Logs for Query Suggestion"
Hao Ma, Haixuan Yang, Irwin King, Michael R. Lyu (912)

"Simultaneous Multilingual Search for Translingual Information Retrieval"
Kristen Parton, Kathleen McKeown, James Allan, Enrique Henestroza (919)

"Query Suggestion Using Hitting Time"
Qiaozhu Mei, Dengyong Zhou, Kenneth Church (921)

"Statistical Power in Retrieval Experimentation"
William Webber, Alistair Moffat, Justin Zobel (946)

"Joke retrieval: recognizing the same joke told differently"
Lisa Friedland, James Allan (952)

"Probabilistic Polyadic Factorization and Its Application to Personalized Recommendation"
Yun Chi, Shenghuo Zhu, Yihong Gong, Yi Zhang (983)

"Can Phrase Indexing Help to Process Non-Phrase Queries?"
Mingjie Zhu, Shuming Shi (996)

"Relating Dependent Indexes using Dempster-Shafer Theory"
Lixin Shi, Jian-Yun Nie, Guihong Cao (1094)


Knowledge Management Papers

"A Sparse Gaussian Processes Classification Framework for Fast Tag Suggestions"
Yang Song, Lu Zhang, C. Lee Giles (15)

"Local Approximation of PageRank and Reverse PageRank"
Li-Tal Mashiach, Ziv Bar-Yossef (30)

"An Algorithm to Determine Peer-Reviewers"
Marko Rodriguez, Johan Bollen (77)

"Extremely Fast Text Feature Extraction for Classification and Indexing"
George Forman, Evan Kirshenbaum (98)

"A Framework for Estimating Complex Probability Density Structures in Data Stream"
Arnold Boedihardjo, Chang-Tien Lu, Chen Feng (100)

"BNS Feature Scaling: An Improved Representation over TF-IDF for SVM Text Classification"
George Forman (105)

"Fast Mining of Complex Time-Stamped Events"
Hanghang Tong, Yasushi Sakurai, Tina Eliassi-Rad, Christos Faloutsos (109)

"Classifying Networked Entities with Modularity Kernels"
Dell Zhang, Robert Mao (146)

"Error-Driven Generalist+Experts (EDGE): A Multi-stage Ensemble Framework for Text Categorization"
Jian Huang, Omid Madani, C. Lee Giles (167)

"Intra-document Structural Frequency Features for Semi-supervised Domain Adaptation"
Andrew Arnold, William Cohen (176)

"Mining Term Association Patterns from Search Logs for Effective Query Reformulation"
Xuanhui Wang, ChengXiang Zhai (178)

"Characterizing and Predicting Community Members from Evolutionary and Heterogeneous Networks"
Sourav S Bhowmick, Qiankun Zhao, Xin Zheng, Kai Yi (184)

"Real-Time Data Pre-Processing Technique for Efficient Feature Extraction in Large Scale Datasets"
Ying Liu, Lucian V. Lita, Radu Stefan Niculescu, Kun Bai, Prasenjit Mitra, C. Lee Giles (671)

"Information Shared by Many Objects"
Chong Long, Xiaoyan Zhu, Ming Li, Bin Ma (942)

"Identification of Class Specific Discourse Patterns"
Anup Kumar Chalamalla, Sumit Negi, L. Venkata Subramaniam, Ganesh Ramakrishnan (247)

"REDUS: Finding Reducible Subspaces in High Dimensional Data"
Xiang Zhang, Feng Pan, Wei Wang (297)

"An Effective Algorithm for Mining 3-Clusters in Vertically"
Faris Alqadah, Raj Bhatnagar (308)

"Structure Feature Selection for Graph Classification"
Hongliang Fei, Jun Huan (325)

"Non-Local Evidence for Expert Finding"
Krisztian Balog, Maarten de Rijke (865)

"Academic Conference Homepage Understanding Using Constrained Hierarchical Conditional Random Fields"
Xin Xin, Juanzi Li, Jie Tang, Qiong Luo (398)

"Mining Influential Attributes That Capture Class and Group Contrast Behaviour"
Elsa Loekito, James Bailey (427)

"Transfer Learning From Multiple Source Domains via Consensus Regularization"
Ping Luo, Fuzhen Zhuang, Hui Xiong, Yuhong Xiong, Qing He (441)

"EDSC: efficient density-based subspace clustering"
Ira Assent, Ralph Krieger, Emmanuel Muller, Thomas Seidl (464)

"Proactive Learning: Cost-Sensitive Active Learning with Multiple Imperfect Oracles"
Pinar Donmez, Jaime Carbonell (613)

"Identifying Table Boundaries in Digital Documents via Sparse Line Detection"
Ying Liu, Prasenjit Mitra, C. Lee Giles (625)

"Finding Informative Commonalities in Concept Collections"
Simona Colucci, Eugenio Di Sciascio, Francesco Donini, Eufemia Tinelli (685)

"Inferring Semantic Query Relations from Collective User Behavior"
Nish Parikh, Neel Sundaresan (703)

"The query-flow graph: model and applications "
Paolo Boldi, Francesco Bonchi, Carlos Castillo, Debora Donato, Aristides Gionis, Sebastiano Vigna (701)

"Discovering Leaders from Community Actions"
Amit Goyal, Francesco Bonchi, Laks V. S. Lakshmanan (711)

"Peer Production of Structured Knowledge - an Empirical Study of Ratings and Incentive Mechanisms"
Christian Huetter, Conny Kuehne (714)

"Scalable Community Discovery on Textual Data with Relations"
Huajing Li, Zaiqing Nie, wang-chien Lee, C. Lee Giles, Ji-Rong Wen (728)

"Identification of Gene Function Using Prediction by Partial Matching (PPM) Language Models"
Malika Mahoui, W. Tehan, Arvind Kumar Thirumalaiswamy Sekhar, S. Chilukuri (735)

"Predicting Web Spam With HTTP Session Information"
Steve Webb, James Caverlee, Calton Pu (777)

"Data Weaving: Scaling Up the State of the Art in Data Clustering"
Ron Bekkerman, Martin Scholz (800)

"Clustered Subset Selection and its Applications on IT Service Metrics"
Christos Boutsidis, Jimeng Sun, Nikos Anerousis (805)

"A Consensus Based Approach to Constrained Clustering of Software Requirements"
Chuan Duan, Jane Cleland-Huang, Bamshad Mobasher (833)

"Link Privacy in Social Networks"
Aleksandra Korolova, Rajeev Motwani, Shubha U. Nabar, Ying Xu (883)

"Predicting Individual Disease Risk Based on Medical History"
Darcy Davis, Nitesh Chawla, Nicholas Christakis, Nicholas Blumm, Laszlo Barabasi (917)

"Association Thesaurus Construction Methods based on Link Co-occurrence Analysis For Wikipedia"
Masahiro Ito, Kotaro Nakayama, Takahiro Hara, Shojiro Nishio (928)

"Spam Characterization and Detection in Peer-to-Peer File-Sharing Systems"
Dongmei Jia, Wai Gen Yee, Frieder Ophir (941)

"Learning to Link with Wikipedia"
David Milne, Ian H. Witten (1046)

"Wildcards for Lightweight Information Integration in Virtual Desktops"
Rodolfo Stecher, Claudia Niederee, Wolfgang Nejdl (1060)

"Fast Correlation Analysis on Time Series Datasets"
Philon Nguyen, Nematollaah Shiri (1073)

"On Effective Presentation of Graph Patterns: A Structural Representative Approach"
Chen Chen, Xide Lin, Xifeng Yan, Jiawei Han (1084)

"Learning a two-stage SVM/CRF sequence classifier"
Guilherme Hoefel, Charles Elkan (349)


Database Papers

"Real-Time New Event Detection for Video Streams"
Gang Luo, Rong Yan, Philip Yu (64)

"Content-Based Filtering for Efficient Online Materialized View Maintenance"
Gang Luo, Philip Yu (119)

"Supporting Sub-Document Updates and Queries in an Inverted Index"
Vuk Ercegovac, Vanja Josifovski, Ning Li, Mauricio Mediano, Eugene Shekita (163)

"Efficient techniques for document sanitization"
Venkatesan Chakaravarthy, Himanshu Gupta, Prasan Roy, Mukesh Mohania (225)

"Pruning Nested XQuery Queries"
Billel Gueni, Talel Abdessalem, Bogdan Cautis, Emmanuel Waller (231)

"PROQID: Partial restarts of queries in distributed databases"
Jon Olav Hauglid, Kjetil Norvag (244)

"Exploiting Pipeline Interruptions for Efficient Memory Allocation"
Josep Aguilar Saborit, Mohammad Jalali, Dave Sharpe, Victor MuntŽs-Mulero (303)

"Modeling and Exploiting Query Interactions in Database Systems"
Mumtaz Ahmad, Ashraf Aboulnaga, Shivnath Babu, Kamesh Munagala (892)

"Anomaly-Free Incremental Output in Stream Processing"
George Mihaila, Ioana Roxana Stanoi, Christian Lang (392)

"Valid Scope Computation for Location-Dependent Spatial Query in Mobile Broadcast Environments"
Ken C. K. Lee, Josh Schiffman, Baihua Zheng, Wang-chien Lee (416)

"Records Retention in Relational Database Systems"
Ahmed Ataullah, Frank Tompa, Ashraf Aboulnaga (443)

"Linear Time Membership in a Class of Regular Expressions with Interleaving and Counting"
Giorgio Ghelli, Dario Colazzo, Carlo Sartiani (468)

"A Language for Manipulating Clustered Web Documents Results"
Gloria Bordogna, Alessandro Campi, Giuseppe Psaila, Stefania Ronchi (570)

"Rewriting of Visibly Pushdown Languages for XML Data Integration"
Alex Thomo, Venkatesh Srinivasan (693)

"Dual Encryption for Query Integrity Assurance"
Haixun Wang, Jian Yin, Chang-Shing Perng, Philip Yu (1044)

"Minimum Effort Driven Dynamic Faceted Search in Structured Databases"
Senjuti Basu Roy, Haidong Wang, Gautam Das, Ullas Nambiar, Mukesh Mohania (756)

"SNIF TOOL: Sniffing for Patterns in Continuous Streams"
Abhishek Mukherji, Elke Rundensteiner, David Brown, Venkatesh Raghavan (773)

"A Novel Optimization Approach to Efficiently Process Aggregate Similarity Queries in MAM"
Humberto Razente, Maria Camila Barioni, Agma Traina, Christos Faloutsos, Caetano Traina (776)

"A New Method for Indexing Genomes Using On-Disk Suffix Trees"
Marina Barsky, Ulrike Stege, Alex Thomo, Chris Upton (857)

"Dynamic Faceted Search for Discovery-driven Analysis"
Dash Debabrata, Rao Jun, Nimrod Megiddo, Anastasia Ailamaki, Guy Lohman (863)

"Modeling LSH for Performance Tuning"
Wei Dong, Zhe Wang, William Josephson, Moses Charikar, Kai Li (908)

"Integrating Web Query Results: Holistic Schema Matching"
Shui-Lung Chuang, Kevin Chang (936)

"A Step towards Incremental Maintenance of Composed Schema Mappings"
Qian Gang (977)

"Heuristic Approaches for Checking Containment of Generalized Tree-Pattern Queries"
Pawel Placek, Dimitri Theodoratos, Stefanos Souldatos, Theodore Dalamagas, Timos Sellis (979)


Industry Papers

"MedSearch: A Specialized Search Engine for Medical Information"
Gang Luo, Chunqiang Tang, Hao Yang, Xin Wei (65)

"Semi-automated logging of contact center telephone calls"
Roy Byrd, Mary Neff, Wilfried Teiken, Youngja Park, Keh-Shin F Cheng, Stephen Gates, Karthik Visweswariah (126)

"Some Rewrite Optimizations of XQuery Navigation in DB2"
Jarek Gryz, Guangjun Xie, Qi Cheng, Calisto Zuzarte (315)

"An Empirical Study of Required Dimensionality for Large-scale Latent Semantic Indexing Applications"
Roger Bradford (370)

"Web-Scale Named Entity Recognition"
Casey Whitelaw, Alex Kehlenbeck, Nemanja Petrovic, Lyle Ungar (531)


Gold Supporters

Silver Supporters

Bronze Supporters
Video Supporter

Book Exhibits

Local Organization