About me

I am an Assistant Professor at Virginia Tech, where I currently work as an AI Research Scientist, Digital Libraries for the University Library. I completed my Ph.D. from Virignia Tech in July 2024 advised by Dr. Edward A. Fox . My dissertation titled "Improving Access to ETD Elements Through Chapter Categorization and Summarization" brings computational access to long documents such as Electronic Theses and Dissertations (ETDs) by providing readers with more granular level metadata information such as chapter summaries and classification labels.

Currently, as an AI researcher working on building solutions to help gain better insgihts from data from our Virginia Tech's digital library. My research mainly focuses on natural language processing, data mining, machine learning and digital libraries.

Research Interests

  • Natural Language Processing

  • Digital Libraries

  • Machine Learning

  • Large Language Models

  • Scholarly big data

  • Text summarization and Classification

  • Deep learning

Resume

Work Experience

  1. Virginia Tech

    August, 2024 - Present

    AI Research Scientist, Assistant Professor, University Libraries.

  2. Virginia Tech

    August, 2019 — August, 2024

    Graduate Research Assistant

  3. ADP

    May, 2019 — August, 2019

    Global Product and Technology Intern

  4. Virginia Tech

    January, 2019- May, 2019

    Graduate Teaching Assistant

  5. Tata Consultancy Services

    April, 2016- November, 2017

    Assistant Systems Engineer

Education

  1. Virginia Tech

    2019 — 2024

    Ph.D. in Computer Science

  2. Virginia Tech

    2022

    M.S in Computer Science

  3. West Bengal University of Technology

    2015

    Bachelor of Technology in Computer Science and Applications

Research

Publications

  1. Integrated Digital Library System for Long Documents and their Elements. 2023 PDF

    S. Chekuri, P. Chandrasekar, B. Banerjee, S. Park, N. Masrourisaadat, A. Ahuja, W. A. Ingram, J. Wu and E. A. Fox, "An Integrated Digital Library System for Long Documents and their Elements," 2023 ACM/IEEE Joint Conference on Digital libraries".

  2. Applications of data analysis on scholarly long documents. 2022 PDF

    B. Banerjee, W. A. Ingram, J. Wu and E. A. Fox, "Applications of data analysis on scholarly long documents," 2022 IEEE International Conference on Big Data (Big Data), Osaka, Japan, 2022, pp. 2473-2481, doi: 10.1109/BigData55660.2022.10020935.

  3. Opening scholarly documents through text analytics. 2022PDF

    Bipasha Banerjee. 2022. Opening Scholarly Documents through Text Analytics. In Proceedings of the 22nd ACM/IEEE Joint Conference on Digital Libraries (Cologne, Germany) (JCDL 22). Association for Computing Machinery, New York, NY, USA, Article 47, 2 pages. https://doi.org/10.1145/3529372.3530948

  4. Building A Large Collection of Multi-domain Electronic Theses and Dissertations. 2021PDF

    S. Uddin, B. Banerjee, J. Wu, W. A. Ingram and E. A. Fox, "Building A Large Collection of Multi-domain Electronic Theses and Dissertations," 2021 IEEE International Conference on Big Data (Big Data), Orlando, FL, USA, 2021, pp. 6043-6045, doi: 10.1109/BigData52589.2021.9672058.

Presentations

  1. Help Me Help You - A Mixed-Initiative Approach To Explore Book-length Documents. 2022

    Talk presented at CIKM 2022 Workshop on Human-in-the-loop Data Curation

  2. Applications of mining ETDs. 2021

    Presented at ETD 2021 conference.

  3. Extracting Information from Electronic Thesis and Dissertations. 2021

    Talk presented at the ACM Capital Region Celebration of Women (CAPWIC 2021)

Portfolio