About me

I am a Ph.D. candidate in Computer Science from Virginia Tech. I am part of the DLRL lab . My advisor is Dr. Edward A. Fox . My research mainly focuses on natural language processing, data mining and digital libraries.

My research speicifically focusses on bringing computational access to book-length documents such as electronic theses and dissertations (ETDs) by providing more granualar level access to the readers. Chapter-level summaries and classification labels can help readers and stakeholders find the portion of interest more efficiently.

I am on the job market and planning on graduating May 2024.

Research Interests

  • Natural Language Processing

  • Machine Learning

  • Large Language Models

  • Scholarly big data

  • Text summarization

  • Deep learning

Resume

Work Experience

  1. Virginia Tech

    August, 2019 — Present

    Graduate Research Assistant

  2. ADP

    May, 2019 — August, 2019

    Global Product and Technology Intern

  3. Virginia Tech

    January, 2019- May, 2019

    Graduate Teaching Assistant

  4. Tata Consultancy Services

    April, 2016- November, 2017

    Assistant Systems Engineer

Education

  1. Virginia Tech

    2019 — Present

    Ph.D. in Computer Science

  2. Virginia Tech

    2022

    M.S in Computer Science

  3. West Bengal University of Technology

    2015

    Bachelor of Technology in Computer Science and Applications

Research

Publications

  1. Integrated Digital Library System for Long Documents and their Elements. 2023 PDF

    S. Chekuri, P. Chandrasekar, B. Banerjee, S. Park, N. Masrourisaadat, A. Ahuja, W. A. Ingram, J. Wu and E. A. Fox, "An Integrated Digital Library System for Long Documents and their Elements," 2023 ACM/IEEE Joint Conference on Digital libraries".

  2. Applications of data analysis on scholarly long documents. 2022 PDF

    B. Banerjee, W. A. Ingram, J. Wu and E. A. Fox, "Applications of data analysis on scholarly long documents," 2022 IEEE International Conference on Big Data (Big Data), Osaka, Japan, 2022, pp. 2473-2481, doi: 10.1109/BigData55660.2022.10020935.

  3. Opening scholarly documents through text analytics. 2022PDF

    Bipasha Banerjee. 2022. Opening Scholarly Documents through Text Analytics. In Proceedings of the 22nd ACM/IEEE Joint Conference on Digital Libraries (Cologne, Germany) (JCDL 22). Association for Computing Machinery, New York, NY, USA, Article 47, 2 pages. https://doi.org/10.1145/3529372.3530948

  4. Building A Large Collection of Multi-domain Electronic Theses and Dissertations. 2021PDF

    S. Uddin, B. Banerjee, J. Wu, W. A. Ingram and E. A. Fox, "Building A Large Collection of Multi-domain Electronic Theses and Dissertations," 2021 IEEE International Conference on Big Data (Big Data), Orlando, FL, USA, 2021, pp. 6043-6045, doi: 10.1109/BigData52589.2021.9672058.

Presentations

  1. Help Me Help You - A Mixed-Initiative Approach To Explore Book-length Documents. 2022

    Talk presented at CIKM 2022 Workshop on Human-in-the-loop Data Curation

  2. Applications of mining ETDs. 2021

    Presented at ETD 2021 conference.

  3. Extracting Information from Electronic Thesis and Dissertations. 2021

    Talk presented at the ACM Capital Region Celebration of Women (CAPWIC 2021)

Portfolio