New England NLP Symposium

Oral (3:30-4:20 PM)

AI Alignment at Your Discretion
Presenter: Hadi Khalaf
(Best Paper Awardee! 🏆🎉🎉)
CaKE: Circuit-aware Editing Enables Generalizable Knowledge Learners
Presenter: Yunzhi Yao
Training Language Models on Synthetic Edit Sequences Improves Code Synthesis
Presenter: Rob Fergus
(Outstanding Paper Awardee! 🏆)
Language Models use Lookbacks to Track Beliefs
Presenter: Nikhil Prakash
How Post-Training Reshapes LLMs: A Mechanistic View on Knowledge, Truthfulness, Refusal, and Confidence
Presenter: Shichang Zhang
(Outstanding Paper Awardee! 🏆)

Poster (12:50-2:10 PM)

Listed by random order.

Bayesian Teaching Enables Probabilistic Reasoning in Large Language Models
Presenter: Linlu Qiu
What's Producible May Not Be Reachable: Measuring the Steerability of Generative Models
Presenter: Sarah Bentley
Who Evaluates the Evaluations? Objectively Scoring Text-to-Image Prompt Coherence Metrics with (TS2)
Presenter: Mahsa Khoshnoodi
TOMATO: Assessing Visual Temporal Reasoning Capabilities in Multimodal Foundation Models
Presenter: Ziyao Shangguan
A LSTM language model learns Hindi-Urdu case-agreement interactions, and has a linear encoding of case
Presenter: Satoru Ozaki
A Probabilistic Inference Approach to LLM Inference-Time Scaling
Presenter: Isha Puri
A Taxonomy of Transcendence
Presenter: Natalie Abreu
Are Foundation Models Foundational? Synthetic Tasks Reveal World Models
Presenter: Peter Chang
Between Circuits and Chomsky: Pre-pretraining on Formal Languages Imparts Linguistic Biases
Presenter: William Merrill
Boosting Large Language Models with Mask Fine-Tuning
Presenter: Yue Bai
Building A Unified AI-centric Language System: analysis, framework and future work
Presenter: Edward Hong Wang
Can model interpretations predict behavior on unseen data?
Presenter: Victoria Li
Capturing Human Cognitive Styles with Language: Towards an Experimental Evaluation Paradigm
Presenter: Vasudha Varadarajan
ChatGPT Doesn't Trust Chargers Fans: Guardrail Sensitivity in Context
Presenter: Victoria Li
Chunk-Distilled Language Modeling
Presenter: Jiawei (Joe) Zhou
Classical Computation in Connectionist Models
Presenter: Aditya Yedetore
CodeSteer: Symbolic-Augmented Language Models via Code/Text Guidance
Presenter: Yongchao Chen
Communication Makes Perfect: Persuasion Dataset Construction via Multi-LLM Communication
Presenter: Weicheng Ma
Contextual morphologically-guided tokenization for pretrained Latin BERT models
Presenter: Marisa Hudspeth
Continued Pre-training LLMs to Learn Simulated Knowledge Updates
Presenter: Aochong Oliver Li
Do Automatic Factuality Metrics Measure Factuality?
Presenter: Sanjana Ramprasad
Escaping Collapse: The Strength of Weak Data for Large Language Model Training
Presenter: Alex Bie
Explaining GPT-4's Schema of Depression Using Machine Behavior Analysis
Presenter: Adithya V Ganesan
Faithful, Unfaithful or Ambiguous? Multi-Agent Debate with Initial Stance for Summary Evaluation
Presenter: Mahnaz Koupaee
Fast Controlled Generation from Language Models with Adaptive Weighted Rejection Sampling
Presenter: Ben Lipkin
Focus Directions Make Your Language Models Pay More Attention to Relevant Contexts
Presenter: Youxiang Zhu
Generating Text from Uniform Meaning Representation
Presenter: Emma Markle
HYBRIDMIND: Meta Selection of Natural Language and Symbolic Language for Enhanced LLM Reasoning
Presenter: Simeng Han
In Search of Lost Language Model Training Dynamics
Presenter: Zhenting Qi
In-Context Learning of Representations
Presenter: Ekdeep Singh Lubana
Inductive Linguistic Reasoning with Large Language Models
Presenter: Raghav Ramji
Is analogy enough to draw novel adjective-noun inferences?
Presenter: Hayley Ross
JumpStarter: A Multi-Agent System for Getting Started on Personal Goals via Adaptive Personal Context Curation
Presenter: Xuanming Zhang
K-Paths: Reasoning over Graph Paths for Drug Repurposing and Drug Interaction Prediction
Presenter: Tassallah Amina Abdullahi
KARL: Knowledge-Aware Retrieval and Representations aid Retention and Learning in Students
Presenter: Matthew Shu
LLMs can Perform Multi-Dimensional Analytic Writing Assessments: A Case Study of L2 Graduate-Level Academic English Writing
Presenter: Zhengxiang Wang
Loss in the Crowd: Hidden Breakthroughs in LM Training
Presenter: Sara Kangaslahti
Mind the Gap: Assessing Crowd-Sourced Linguistic Knowledge on Morphological Gaps of Two Related Languages
Presenter: Jonathan Sakunkoo
Name of Thrones: Evaluating How LLMs Rank Student Names, Race, and Gender in Status Hierarchies
Presenter: Annabella Sakunkoo
NüshuRescue: Reviving the Endangered Nüshu Language with AI
Presenter: Ivory Yang
Performing Scientific Research with Artificial Intelligence Researcher: A Comprehensive Study with Expert-Involved Evaluation
Presenter: Tianyu Liu
Planetarium🪐: A Rigorous Benchmark for Translating Text to Structured Planning Languages
Presenter: Max Zuo
Potemkin Understanding in Large Language Models: Formalizing and Benchmarking Conceptual Comprehension
Presenter: Marina Mancoridis
Privacy Ripple Effects from Adding or Removing Personal Information in Language Model Training
Presenter: Jaydeep Borkar
Probing the Capacity of Language Model Agents to Operationalize Disparate Experiential Context Despite Distraction
Presenter: Sonny George
Projecting Assumptions: The Duality Between Sparse Autoencoders and Concept Geometry
Presenter: Sumedh Hindupur
PUPA: Private User Prompt Annotations Benchmark
Presenter: Siyan Li
Re-Evaluating Evaluation for Multilingual Summarization
Presenter: Jessica Forde
Scaling Makes It Possible: How Large Models Master Impossible Languages
Presenter: Yaning Jia
Scaling Sparse and Dense Retrieval in Decoder-Only LLMs
Presenter: Hansi Zeng
Scaling the Wall: Scaling Vanilla RNNs by Stealing Transformer Geometry
Presenter: Vighnesh Subramaniam
Self-Steering Language Models
Presenter: Gabriel Grand
Sociolinguistic Simulacra: Interactions Between Language and Attitudes in Finetuned Language Models
Presenter: Carter Teplica
Steering Fine Tuning with Targeted Concept Ablation
Presenter: Caden Juang
Superpower🦸⚡️ of the Contrastive Decoding📈 comes from its Imagination🧠💡!
Presenter: Haw-Shiuan Chang
Syntactic and Semantic Control of Large Language Models via Sequential Monte Carlo
Presenter: João Loula
Temporal Working Memory: Query-Guided Segment Refinement for Enhanced Multimodal Understanding
Presenter: Xingjian Diao
TextArena: Beyond Traditional Benchmarks - Evaluating Social Intelligence in Language Models
Presenter: Simon Yu
The Same but Different: Structural Similarities and Differences in Multilingual Language Modeling
Presenter: Ruochen Zhang
The Semantic Hub Hypothesis: Language Models Share Semantic Representations Across Languages and Modalities
Presenter: Zhaofeng Wu
Toward Reliable Ad-hoc Scientific Information Extraction: A Case Study on Two Materials Datasets
Presenter: Satanu Ghosh
Working Memory Identifies Reasoning Limits in Language Models
Presenter: Chunhui Zhang
Auto-encoding Scientific Conclusions for Hypothesis Generation
Presenter: Brian Ondov
(How) Do Language Models Track State?
Presenter: Belinda Zou Li
A Systematic Evaluation of Transformer-LM Representations for Capturing Author States and Traits
Presenter: Khushboo Singh
Accelerating robust in-context language learning
Presenter: Johnathan Sun
Controlling Factual Associations and Visual Perception in Vision-Language Models
Presenter: Michal Golovanevsky
Discovering Forbidden Topics in Language Models
Presenter: Can Rager
Do LLMs synthesize technical information like humans?
Presenter: Jillian Ross
Early Detection of Mild Cognitive Impairment Through Voice Assistant Interactions: An LLM-Driven Approach
Presenter: Kristin Qi
Evolutionary Dynamics of Syntax and Semantics in BERT: A Hyperbolic Geometry Perspective
Presenter: Sukanya Krishna
Exploring the Emergence of Shared Multilingual Concept Representations in LLMs
Presenter: Kerem Sahin
First things first: Universal path dependence of learning
Presenter: David Mayo
ImpScore: A Learnable Metric For Quantifying The Implicitness Level of Sentences
Presenter: Yuxin Wang
Investigating the Knowledge-Perception Trade-off in Vision-Language Models through Visual Counterfactuals
Presenter: Michal Golovanevsky
Modeling Noisy-Channel Language Processing with Reanalysis of Possible Errors: A Probabilistic Inference Approach
Presenter: Thomas Clark
Paths Not Taken: Optimize Multilingual Factual Recall Pathways via Simple Zero-Rank Interventions
Presenters: Meng Lu, Ruochen Zhang, Ellie Pavlick, Carsten Eickhoff
Reasoning-based Regression: Teaching Language Models to Score Natural Language Features
Presenter: Diane Tchuindjo
Supporting Biomedical Discovery with Human Agent Collaboration for Literature Grounded Search and Reasoning
Presenter: Shannon Shen
The Dual-Route Model of Induction
Presenter: Sheridan Feucht
The Role of PropBank Sense Numbers in AMR-to-text Generation and Text-to-AMR Parsing
Presenter: Thu Hoang
ThoughtCoder: Structured and adaptive problem solving via language model programming
Presenter: Ced Zhang
Reviving Endangered and Extinct Languages with Large Language Models
Presenter: Weicheng Ma
United We Stand: Multi-LLM Collaboration for Advancing Scientific Research
Presenter: Weicheng Ma
What's Hidden in Flemish Stories: How Can LLMs Unveil the Affective Nuances in Daily Narratives?
Presenters: Ratna Kandala, Katie Hoemann

New England NLP Meeting Series

Accepted Work

Oral (3:30-4:20 PM)

(Best Paper Awardee! 🏆🎉🎉)

(Outstanding Paper Awardee! 🏆)

(Outstanding Paper Awardee! 🏆)

Poster (12:50-2:10 PM)