Editing Transformer Models with Common Sense Knowledge (EMNLP Conference, Dec. 2023)

  • Post category:Projects

Editing Transformer Models with Common Sense Knowledge (EMNLP Conference, Dec. 2023)Partner: Allen Institute for AIParticipants: Anshita Gupta, Debanjan Mondal, Akshay Krishna SheshadriDescription: Memory editing for updating encyclopedic knowledge in transformers has received increasing attention, but it is unclear if these methods can be adapted for nuanced common sense knowledge. In this research, we proposed an adaptation of MEMIT to edit common sense mistakes in GPT-2 Large and XL. We extend editing to various token locations and employ a robust layer selection strategy. Our results suggest a promising path for improving GPT by incorporating context-specific user feedback about common sense through direct model editing as well as fixing and customizing model behaviors using human-in-the-loop-systems.

Continue ReadingEditing Transformer Models with Common Sense Knowledge (EMNLP Conference, Dec. 2023)

Generating Metrics for High-Performance Computing Clusters

  • Post category:Projects

Partner: Unity DS4CG 2023. Unity is a collaborative, multi-institutional high-performance computing cluster, primarily used for research computing. The Unity project focused on generating useful metrics and analysis for Unity by building a pipeline to a database that could power a live dashboard for Unity’s admin staff. Metrics included unnecessarily idle GPUs, daily and weekly node usage, total resource usage, and wait time. Additionally, a prediction model for wait time at job submission time was built.

Continue ReadingGenerating Metrics for High-Performance Computing Clusters

Detecting Extreme Speech in YouTube Videos

  • Post category:Projects

Partner: Media Cloud The surge in multimodal content shared online, particularly on platforms like YouTube and Instagram, has increased the need for effective extreme and hateful speech detection systems. Current systems often fail to address the nuanced challenges of detecting explicit and implicit hate speech in multimodal contexts, where speech and text combine to convey harmful messages. Media Cloud, an open-source media research platform, helps researchers study news and information flow globally. This DS4CG team worked in collaboration with Media Cloud to focus on advancing multimodal hate speech detection by addressing three key challenges: the lack of comprehensive, human-annotated datasets; the absence of systems capable of analyzing both audio and text data simultaneously; and the need for fine-grained detection of subtle hate speech. The study leverages distinct latent features from audio and text to improve…

Continue ReadingDetecting Extreme Speech in YouTube Videos

Extracting Bylines from Media in Multiple Languages

  • Post category:Projects

Partner: Media Cloud Over the past few decades, the rapid expansion of digital media has transformed how information is shared and consumed. However, this growth presents challenges such as content moderation, misinformation detection, and addressing media bias. Categorizing articles by authors or agencies has become a critical step in tackling these issues, especially for both high- and low-resource settings. This DS4CG project evaluated existing and newly implemented tools for extracting author names from news articles. Using Media Cloud’s article archive, 100 documents from 10 languages were sampled and annotated by volunteers fluent in each language, following guidelines developed with the Media Cloud team. A pipeline was designed to test these tools, and their performance was assessed using five NLP metrics.

Continue ReadingExtracting Bylines from Media in Multiple Languages

Analyzing Energy Usage with Predictive Modeling

  • Post category:Projects

Partner: Massachusetts Division of Capital Asset Management & Maintenance DS4CG 2023. DCAMM is responsible for managing resources in various state buildings like state hospitals, prisons, universities, community colleges, office buildings. This project analyzed 5 year energy usage of 279 utility meters in 23 academic buildings. Using this data, time-series prediction models were developed for 12-month energy consumption of various utilities (electricity, steam, natural gas, water) by building. Prediction is the first step towards data-driven efficient management of energy resources and energy conservation.

Continue ReadingAnalyzing Energy Usage with Predictive Modeling

Analysis of #StopAsianHate on Twitter

  • Post category:Projects

Partner: Co-Insights In partnership with Co-Insights, this DS4CG project explored the #StopAsianHate movement using NLP to analyze topic transitions, identify significant events, and highlight key accounts driving conversations. Unlike prior studies focused on peak activity, this longitudinal analysis examined changes over time, implementing text embedding and clustering models to uncover frequent unigrams, phrases, and example tweets.

Continue ReadingAnalysis of #StopAsianHate on Twitter