Generative AI has invaded our places of work and learning with the promise of increasing productivity.
However, many generative AIs are built on (Large Language Model) LLMs which act as next-wordpredictors
based on probabilistic modeling. This leads to numerous challenges, especially ambiguity.
This proposal addresses the research question: How can we reduce ambiguity in AI generated text?
The current proposal seeks to 1) identify ways to algorithmically identify and flag ambiguity, and 2)
explore identifying levels of ambiguity and 3) explore ways in which ambiguity could be reduced or
managed.
Once ambiguity is identified, we intend to use a LLM application to generate improved alternatives. This
project will help improve the quality of human interactions with AI applications such as chatbots.
Project Information Subsection
{Empty}
{Empty}
{Empty}
{Empty}
Practical applications
{Empty}
{Empty}
{Empty}
CR-Rutgers
{Empty}
Yes
Already behind3Start date is flexible
6
{Empty}
{Empty}
{Empty}
{Empty}
Milestone Title: Launch Milestone Description: Give a launch presentation during the monthly meeting get HPC access and explore and validate multiple LLMs on ready to use datasets Completion Date Goal: 2024-04-19 Actual Completion Date: 2024-05-31
Milestone Title: Identify ambiguity Milestone Description: Narrow down on most promising approaches to identify ambiguity and run tests Completion Date Goal: 2024-06-01 Actual Completion Date: 2024-07-15
Milestone Title: Finetuning Milestone Description: Apply Finetuning, and other customizations to the LLMs to generate suitable text Completion Date Goal: 2024-07-16 Actual Completion Date: 2024-08-31
Milestone Title: Finalize Documentation Milestone Description: Produce a workflow that Write a white paper, prepare presentation and package any other deliverable/s. Completion Date Goal: 2024-09-01 Actual Completion Date: 2024-10-01
Milestone Title: Wrap presentation Milestone Description: Give a wrap presentation at the monthly meeting and have an exit interview.
{Empty}
{Empty}
{Empty}
The student will gain familiarity with Rutgers' HPC system, Amarel, and understand how to run NLP analysis using Amarel.
{Empty}
Jupyter notebooks with examples on how to run NLP analysis.
Access to the Amarel cluster, Rutgers' HPC system.