-
Context-Based URL Classification for Open Access Datasets and Software in Scholarly Document. 2025 Accepted
Salsabil. L., Obadage, R. R., Banerjee, B., Abeysinghe Y., Alam, S., Färber, M., Ingram, W. A., Fox, E.A. & Wu, J. (Accepted). Context-Based URL Classification for Open Access Datasets and Software in Scholarly Document. In 2025, The 25th ACM/IEEE Join Conference on Digital Libraries (JCDL).
-
Learning from LLM Disagreement in Retrieval Evaluation. 2025 Accepted
Ingram, W. A, Banerjee, B. & Fox, E.A. (Accepted). Learning from LLM Disagreement in Retrieval Evaluation. In 2025, The 25th ACM/IEEE Join Conference on Digital Libraries (JCDL).
-
Retrieval-Augmented LLMs for ETD Subject Classification. 2025 PDF
Kalir, H., German, F., Aboelnaga, Amr A., Banerjee, B., Eldardiry, H. & Ingram W. A. Retrieval-Augmented LLMs for ETD Subject Classification. In 2025 IEEE International Conference on Big Data (BigData).
-
Using an Ensemble Approach for Layout Detection and Extraction from Historical Newspapers. 2025 PDF
Jadhav, A., Banerjee, B., & Goyne J. Using an Ensemble Approach for Layout Detection and Extraction from Historical Newspapers. In 2025 IEEE International Conference on Big Data (BigData).
-
Optical Character Recognition for Pre-Digital Historical Documents using Large Language Models. 2025 Accepted and Presented
Miller, C., & Banerjee, B. Optical Character Recognition for Pre-Digital Historical Documents using Large Language Models.
In 2025, The 29th International Conference on Machine Learning and Applications. https://www.icmla-conference.org/icmla25/acceptedpapers.html
-
When LLMs Disagree: Diagnosing Relevance Filtering Bias and Retrieval Divergence in SDG Search. 2025 PDF
Ingram, W. A., Banerjee, B., & Fox, E. A. (2025). When LLMs Disagree: Diagnosing Relevance Filtering Bias and Retrieval Divergence in SDG Search. LLM4Eval at SIGIR 2025 DOI: https://doi.org/10.48550/arXiv.2507.02139
-
Evaluating Human-LLM Alignment in ETD Subject Classification. 2025 PDF
Klair, H., German, F., Banerjee, B., & Ingram, W. A. (Accepted).Evaluating Human-LLM Alignment in ETD Subject Classification. In 2025, The 29th International Conference on Theory and Practice of Digital Libraries. https://doi.org/10.1007/978-3-032-06136-2_6
-
Making History Readable. 2024 PDF
Banerjee, B., Goyne, J., & Ingram, W. A. (2024, December). Making History Readable. In 2024 IEEE International Conference on Big Data (BigData) (pp. 8620-8622). IEEE. DOI: 10.1109/BigData62323.2024.10826028
-
Automating Chapter-Level Classification for Electronic Theses and Dissertations. 2024 PDF
Banerjee, B., Ingram, W. A., & Fox, E. A. (2024, December). Automating Chapter-Level Classification for Electronic Theses and Dissertations. In 2024 IEEE International Conference on Big Data (BigData) (pp. 2400-2409). IEEE. DOI: 10.1109/BigData62323.2024.10825418
-
Agentic AI for Improving Precision in Identifying Contributions to Sustainable Development Goals. 2024 PDF
Ingram, W. A., Banerjee, B., & Fox, E. A. (2024, December). Agentic AI for Improving Precision in Identifying Contributions to Sustainable Development Goals. In 2024 IEEE International Conference on Big Data (BigData) (pp. 8677-8679). IEEE. DOI: 10.1109/BigData62323.2024.10825072
-
Integrated Digital Library System for Long Documents
and their Elements. 2023 PDF
S. Chekuri, P. Chandrasekar, B. Banerjee, S. Park, N. Masrourisaadat, A. Ahuja, W. A. Ingram, J. Wu and E. A. Fox, "An Integrated Digital Library System for Long Documents
and their Elements," 2023 ACM/IEEE Joint Conference on Digital libraries".
-
Applications of data analysis on scholarly long documents. 2022 PDF
B. Banerjee, W. A. Ingram, J. Wu and E. A. Fox, "Applications of data analysis on scholarly long documents," 2022 IEEE International Conference on Big Data (Big Data), Osaka, Japan, 2022, pp. 2473-2481, doi: 10.1109/BigData55660.2022.10020935.
-
Opening scholarly documents through text analytics. 2022PDF
Bipasha Banerjee. 2022. Opening Scholarly Documents through Text Analytics. In
Proceedings of the 22nd ACM/IEEE Joint Conference on Digital Libraries (Cologne,
Germany) (JCDL 22). Association for Computing Machinery, New York, NY, USA,
Article 47, 2 pages. https://doi.org/10.1145/3529372.3530948
-
Building A Large Collection of Multi-domain Electronic Theses and Dissertations. 2021PDF
S. Uddin, B. Banerjee, J. Wu, W. A. Ingram and E. A. Fox, "Building A Large Collection of Multi-domain Electronic Theses and Dissertations," 2021 IEEE International Conference on Big Data (Big Data), Orlando, FL, USA, 2021, pp. 6043-6045, doi: 10.1109/BigData52589.2021.9672058.