INDIAN INSTITUTE OF SCIENCE EDUCATION AND RESEARCH KOLKATA

... towards excellence in science

An Autonomous Institution, Under the Ministry of Education, Government of India

Dwaipayan Roy

Assistant Professor
Dept: Departement of Computational and Data Sciences (CDS)
E-mail: dwaipayan.roy [at] iiserkol.ac.in

All Publications:

  1. Roy, Dwaipayan; Mitra, Mandar and Ganguly, Debasis. 2018."To Clean or not to Clean: Document Pre-processing and Reproducibility." ACM Journal of Data and Information Quality, 10, 18
  2. Roy, Dwaipayan; Ganguly, Debasis; Mitra, Mandar and Jones, Gareth J.F.. 2018."Estimating Gaussian Mixture Models in the Local Neighbourhood of Embedded Word Vectors for Query Performance Prediction." Elsevier journal - Information Processing and Management, 56, 1026-1045


  1. Carevic, Zeljko; Roy, Dwaipayan and Mayr, Philipp. 2020." Characteristics of Dataset Retrieval Sessions: Experiences from a Real-life Digital Library", "24th International Conference on Theory and Practice of Digital Libraries". Springer,
  2. Datta, Suchana; Ganguly, Debasis; Roy, Dwaipayan; Bonin, Francesca; Jochim, Charles; Mitra, Mandar and Mitra, Mandar. 2020." Retrieving Potential Causes from a Query Event", "43rd International ACM SIGIR Conference on Research and Development in Information Retrieval ". ACM,
  3. Roy, Dwaipayan; Bhatia, Sumit and Jain, Prateek. 2020." A Topic-Aligned Multilingual Corpus of Wikipedia Articles for Studying Information Asymmetry in Low Resource Languages", "Proceedings of The 12th Language Resources and Evaluation Conference". European Language Resources Association,
  4. Biswas, Chandan; Ganguly, Debasis; Roy, Dwaipayan and Bhattacharya, Ujjwal. 2019." Privacy Preserving Approximate K-means Clustering", "28th ACM International Conference on Information and Knowledge Management". ACM,
  5. Roy, Dwaipayan; Saha, Sourav; Mitra, Mandar; Sen, Bihan and Ganguly, Debasis. 2019." I-REX: A Lucene Plugin for EXplainable IR", "28th ACM International Conference on Information and Knowledge Management". ACM,
  6. Roy, Dwaipayan; Bhatia, Sumit and Mitra, Mandar. 2019." Selecting Discriminative Terms for Relevance Model", "42nd International ACM SIGIR Conference on Research and Development in Information Retrieval". ACM,
  7. Ganguly, Debasis; Afli, Haithem and Roy, Dwaipayan. 2018." Word Embedding based Semantic Cross-Lingual Document Alignment in Comparable Corpora.", "0th Annual Meeting of the Forum for Information Retrieval Evaluation". ACM,
  8. Roy, Dwaipayan; Ganguly, Debasis; Bhatia, Sumit; Behathur, Srikanta and Mitra, Mandar. 2018." Using Word Embeddings for Information Retrieval: How Collection and Term Normalization Choices Affect Performance", "27th ACM International Conference on Information and Knowledge Management". ACM,
  9. Roy, Dwaipayan. 2017." An Improved Test Collection and Baselines for Bibliographic Citation Recommendation", "26th ACM International Conference on Information and Knowledge Management". ACM,
  10. Roy, Dwaipayan. 2017." Word Embedding based Approaches for Information Retrieval", "Seventh BCS-IRSG Symposium on Future Directions in Information Access". ACM,
  11. Roy, Dwaipayan; Ganguly, Debasis; Mitra, Mandar and Jones, Gareth J.F.. 2016." Word Embedding based Relevance Feedback using Kernel Density Estimation", "25th ACM International Conference on Information and Knowledge Management". ACM,
  12. Roy, Dwaipayan; Ray, Kunal and Mitra, Mandar. 2016." From a Scholarly Big Dataset to a Test Collection for Bibliographic Citation Recommendation", "Scholarly Big Data: AI Perspectives, Challenges, and Ideas - AAAI’16". AAAI,
  13. Ganguly, Debasis; Roy, Dwaipayan; Mitra, Mandar and Jones, Gareth J.F.. 2015." Word Embedding based Generalized Language Model for Information Retrieval", "38th International ACM SIGIR Conference on Research and Development in Information Retrieval". ACM,