| 19/12/25 |
Gaifan |
|
| 12/12/25 |
Tulika |
|
| 05/12/25 |
Mike |
|
| 28/11/25 |
Procheta |
|
| 21/11/25 |
Danushka |
|
| 14/11/25 |
Huda |
|
| 07/11/25 |
Cancelled due to EMNLP 2025 in Suzhou |
|
| 31/10/25 |
Tianhui |
Jointly Reinforcing Diversity and Quality in Language Model Generations |
| 24/10/25 |
Lingfang |
Understanding Dataset Difficulty with v-Usable Information |
| 17/10/25 |
Gaifan |
Token Prepending: A Training-Free Approach for Eliciting Better Sentence Embeddings from LLMs |
| 10/10/25 |
Jack |
Argumentative Large Language Models for Explainable and Contestable Claim Verification |
| 03/10/25 |
Mike |
RetroLLM: Empowering Large Language Models to Retrieve Fine-grained Evidence within Generation |
| 26/09/25 |
|
Cancelled due to ICLR/EACL submission DL |
| 19/09/25 |
Procheta |
Sparse Autoencoders Find Highly Interpretable Features in Language Models |
| 12/09/25 |
Danushka |
A New Formulation of Zipf’s Meaning-Frequency Law through Contextual Diversity |
| 05/09/25 |
Huda |
GNN-RAG: Graph Neural Retrieval for Efficient Large Language Model Reasoning on Knowledge Graphs |
| 25/07/25 |
Cancelled due to ACL 2025 in Vienna |
|
| 18/07/25 |
Tianhui |
Lexical Diversity-aware Relevance Assessment for Retrieval-Augmented Generation |
| 11/07/25 |
Lingfang |
Tracking the Feature Dynamics in LLM Training: A Mechanistic Study |
| 04/07/25 |
Danushka |
Mapping 1,000+ Language Models via the Log-Likelihood Vector |
| 27/06/25 |
Gaifan |
A Text is Worth Several Tokens: Text Embedding from LLMs Secretly Aligns Well with The Key Tokens |
| 20/06/25 |
Procheta |
Llama See, Llama Do: A Mechanistic Perspective on Contextual Entrainment and Distraction in LLMs |
| 13/06/25 |
Mike |
Chain of Agents: Large Language Models Collaborating on Long-Context Tasks |
| 06/06/25 |
Tulika |
Debate on Graph: a Flexible and Reliable Reasoning Framework for Large Language Models |
| 30/05/25 |
Huda |
PathRAG: Pruning Graph-based Retrieval Augmented Generation with Relational Path |
| 16/05/25 |
Tianhui |
Learning Dynamics of LLM fine-tuning |
| 02/05/25 |
Jodie |
Investigating Bias in LLM-Based Bias Detection: Disparities between LLMs and Human Perception |
| 11/04/25 |
Lingfang |
Reducing Hallucinations in Vision-Language Models via Latent Space Steering |
| 04/04/25 |
Danushka |
Circuit Tracing: Revealing Computational Graphs in Language Models |
| 28/03/25 |
Gaifan |
Measuring Implicit Bias in Explicitly Unbiased Large Language Models |
| 21/03/25 |
Procheta |
Emergent Misalignment: Narrow finetuning can produce broadly misaligned LLMs |
| 14/03/25 |
Huda |
Decoding on Graphs: Faithful and Sound Reasoning on Knowledge Graphs through Generation of Well-Formed Chains |
| 07/03/25 |
Tulika |
Mitigating large language model hallucinations via autonomous knowledge graph-based retrofitting |
| 28/02/25 |
Mike |
Reasons to Reject? Aligning Language Models with Judgments |
| 21/02/25 |
Tianhui |
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model |
| 07/02/25 |
Jack |
LegalReasoner: A Multi-Stage Framework for Legal Judgment Prediction via Large Language Models and Knowledge Integration |
| 24/01/25 |
Jodie |
Evaluating Language Models as Synthetic Data Generators |
| 17/01/25 |
Lingfang |
Extracting Interpretable Task-Specific Circuits from Large Language Models for Faster Inference |
| 10/01/25 |
Danushka |
Linguistically Conditioned Semantic Textual Similarity |
| 20/12/24 |
Huda |
G-Retriever: Retrieval-Augmented Generation for Textual Graph Understanding and Question Answering |
| 13/12/24 |
Masaru Isonuma |
Controlling Training Data for Model Interpretability and Safety |
| 06/12/24 |
Procheta |
Finding Transformer Circuits with Edge Pruning |
| 29/11/24 |
Taichi |
Chain-of-Dictionary Prompting Elicits Translation in Large Language Models |
| 22/11/24 |
Zhidong Ling |
Construction of Evaluation Dataset for Japanese Semantic Change Detection |
| 15/11/24 |
Gaifan |
Understanding Higher-Order Correlations Among Semantic Components in Embeddings |
| 08/11/24 |
Danushka |
Increasing Diversity While Maintaining Accuracy: Text Data Generation with Large Language Models and Human Interventions |
| 01/11/24 |
Mike |
Large Language Model Is Not a Good Few-shot Information Extractor, but a Good Reranker for Hard Samples! |
| 25/10/24 |
Tulika |
Rethinking Task-Oriented Dialogue Systems: From Complex Modularity to Zero-Shot Autonomous Agent |
| 18/10/24 |
Tianhui |
Commonsense Knowledge Editing Based on Free-Text in LLMs |
| 11/10/24 |
Lingfang |
Interpretability in the Wild: a Circuit for Indirect Object Identification in GPT-2 small |
| 04/10/24 |
Jodie |
Debiasing Vision-Language Models via Biased Prompts |
| 27/09/24 |
Jack |
“I’d Like to Have an Argument, Please”: Argumentative Reasoning in Large Language Models |
| 20/09/24 |
Huda |
Adaptive-RAG: Learning to Adapt Retrieval-Augmented Large Language Models through Question Complexity |
| 13/09/24 |
Procheta |
Skill-Mix: a Flexible and Expandable Family of Evaluations for AI models |
| 06/09/24 |
Danushka |
Mission Impossible Language Models |
| 26/07/24 |
Mike |
ARES: An Automated Evaluation Framework for Retrieval-Augmented Generation Systems |
| 19/07/24 |
Tulika |
SummEdits: Measuring LLM Ability at Factual Reasoning Through The Lens of Summarization |
| 12/07/24 |
Tianhui |
UNcommonsense Reasoning: Abductive Reasoning about Uncommon Situations |
| 28/06/24 |
Lingfang |
What is Your Data Worth to GPT? LLM-Scale Data Valuation with Influence Functions |
| 21/06/24 |
Jodie |
CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark |
| 14/06/24 |
No meeting due to EMNLP submission deadline |
|
| 07/06/24 |
Jack |
Automated Claim Matching with Large Language Models: Empowering Fact-Checkers in the Fight Against Misinformation |
| 31/05/24 |
Huda |
ANALOGYKB: Unlocking Analogical Reasoning of Language Models with A Million-scale Knowledge Base |
| 24/05/24 |
Procheta |
Interpretability in the Wild: a Circuit for Indirect Object Identification in GPT-2 small |
| 17/05/24 |
Danushka |
Vera: A General-Purpose Plausibility Estimation Model for Commonsense Statements |
| 10/05/24 |
No meeting due to PhD workshop |
|
| 03/05/24 |
Mike |
KnowCoder: Coding Structured Knowledge into LLMs for Universal Information Extraction |
| 26/04/24 |
No meeting due to NLP4SocialGood Symposium |
|
| 19/04/24 |
Tianhui |
How Easily do Irrelevant Inputs Skew the Responses of Large Language Models? |
| 12/04/24 |
Tulika |
DisorBERT: A Double Domain Adaptation Model for Detecting Signs of Mental Disorders in Social Media |
| 05/04/24 |
Lingfang |
Learning to Explain: An Information-Theoretic Perspective on Model Interpretation |
| 22/03/24 |
Tianhui |
Semantic Uncertainty: Linguistic Invariances for Uncertainty Estimation in Natural Language Generation |
| 15/03/24 |
Jodie |
Evaluating Bias and Fairness in Gender-Neutral Pretrained Vision-and-Language Models |
| 08/03/24 |
Jack |
From Text to Structure: Using Large Language Models to Support the Development of Legal Expert Systems |
| 01/03/24 |
Danushka |
In-Contextual Gender Bias Suppression for Large Language Models |
| 23/02/24 |
Junjie |
Why Does Surprisal From Larger Transformer-Based Language Models Provide a Poorer Fit to Human Reading Times? |
| 09/02/24 |
Huda |
PromptAgent: Strategic Planning with Language Models Enables Expert-level Prompt Optimization |
| 02/02/24 |
Procheta |
Exploiting Language Characteristics for Legal Domain-Specific Language Model Pretraining |
| 26/01/24 |
Mike |
G-Eval: NLG Evaluation using Gpt-4 with Better Human Alignment |
| 19/01/24 |
Tulika |
Towards Understanding Omission in Dialogue Summarization |
| 12/01/24 |
Saleem |
Towards Interpretable Mental Health Analysis LLMs |
| 15/12/23 |
Danushka, Meng, Jodie |
EMNLP-23 overview |
| 08/12/23 |
No meeting due to EMNLP |
|
| 01/12/23 |
Lingfang |
Interpreting Language Models with Contrastive Explanations |
| 24/11/23 |
Danushka |
An Information-theoretic Approach to Prompt Engineering Without Ground Truth Labels |
| 17/11/23 |
Tianhui |
IJCNLP-AACL 2023 Overview |
| 10/11/23 |
Jodie |
Assessing Multilingual Fairness in Pre-trained Multimodal Representations |
| 03/11/23 |
No meeting due to IJCNLP-AACL |
|
| 27/10/23 |
Danushka |
Discovering Universal Geometry in Embeddings with ICA |
| 20/10/23 |
Huda |
Direct Fact Retrieval from Knowledge Graphs without Entity Linking |
| 13/10/23 |
Tianhui |
|
| 10/06/23 |
Tulika |
|
| 29/09/23 |
Procheta |
Self-Refine: Iterative Refinement with Self-Feedback |
| 22/09/23 |
Mike |
Should You Mask 15% in Masked Language Modeling? |
| 14/09/23 |
Jodie |
Having Beer after Prayer? Measuring Cultural Bias in Large Language Models |
| 07/09/23 |
Danushka |
XL-LEXEME: WiC Pretrained Model for Cross-Lingual LEXical sEMantic changE |
| 20/07/23 |
Danushka (ACL update) |
|
| 13/07/23 |
No meeting (ACL@Toronto) |
|
| 06/07/23 |
Huda |
Multi-CLS BERT: An Efficient Alternative to Traditional Ensembling. poster and presentation |
| 29/06/23 |
Tianhui |
LoRA: Low-Rank Adaptation of Large Language Models(https://arxiv.org/abs/2106.09685) |
| 22/06/23 |
No meeting (EMNLP preparations) |
|
| 15/06/23 |
Procheta |
Tutorial rehersal |
| 08/06/23 |
Skipped due to NLP Symposium |
|
| 01/06/23 |
Danushka |
Diversifying Content Generation for Commonsense Reasoning with Mixture of Knowledge Graph Experts |
| 25/05/23 |
Mike |
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models |
| 18/05/23 |
Jodie |
We’re Afraid Language Models Aren’t Modeling Ambiguity |
| 11/05/23 |
Huda |
Decoupling Mixture-of-Graphs: Unseen Relational Learning for Knowledge Graph Completion by Fusing Ontology and Textual Experts |
| 04/05/23 |
Tianhui |
Multimodal Chain-of-Thought Reasoning in Language Models |
| 27/04/23 |
Danushka |
Generated Knowledge Prompting for Commonsense Reasoning |
| 20/04/23 |
Procheta |
“Why Should I Trust You?”: Explaining the Predictions of Any Classifier |
| 13/04/23 |
Tulika |
ECIR conference update |
| 06/04/23 |
Mike |
Eider: Empowering Document-level Relation Extraction with Efficient Evidence Extraction and Inference-stage Fusion |
| 30/03/23 |
Jodie |
XLM-V: Overcoming the Vocabulary Bottleneck in Multilingual Masked Language Models |
| 23/03/23 |
Huda |
Do Pre-trained Models Benefit Knowledge Graph Completion? A Reliable Evaluation and a Reasonable Approach |
| 16/03/23 |
Xiaohang |
Time-Aware Language Models as Temporal Knowledge Bases |
| 09/03/23 |
Gaifan |
Compressing Sentence Representation for Semantic Retrieval via Homomorphic Projective Distillation |
| 02/03/23 |
Tianhui |
Metric-guided Distillation: Distilling Knowledge from the Metric to Ranker and Retriever for Generative Commonsense Reasoning |
| 16/02/23 |
Tulika |
Aspect-Controllable Opinion Summarization |
| 09/02/23 |
Procheta |
Dialog Inpainting: Turning Documents into Dialogs |
| 02/02/23 |
Danushka |
Accelerating Large-Scale Inference with Anisotropic Vector Quantization |
| 26/01/23 |
Jodie |
Retrieval Augmentation for Commonsense Reasoning: A Unified Approach |
| 19/01/23 |
Huda |
FPC: Fine-tuning with Prompt Curriculum for Relation Extraction |
| 05/01/23 |
Mike |
Generative Knowledge Graph Construction: A Review |
| 22/12/22 |
Tulika |
Many Hands Make Light Work: Using Essay Traits to Automatically Score Essays |
| 15/12/22 |
Jodie and Danushka |
EMNLP 2022 Summary |
| 01/12/22 |
Jodie |
ESimCSE: Enhanced Sample Building Method for Contrastive Learning of Unsupervised Sentence Embedding |
| 24/11/22 |
Aida |
MelBERT: Metaphor Detection via Contextualized Late Interaction using Metaphorical Identification Theories |
| 17/11/22 |
Huda |
Deep Bidirectional Language-Knowledge Graph Pretraining |
| 10/11/22 |
Danushka |
Learning To Retrieve Prompts for In-Context Learning |
| 03/11/22 |
Tulika |
Dr. Can See: Towards a Multi-modal Disease Diagnosis Virtual Assistant |
| 27/10/22 |
Mike |
An End-to-end Model for Entity-level Relation Extraction using Multi-instance Learning |
| 13/10/22 |
Tianhui |
Enhancing Topic-to-Essay Generation with External Commonsense Knowledge |
| 06/10/22 |
Huda |
Document-Level Relation Extraction with Sentences Importance Estimation and Focusing |
| 29/09/22 |
Procheta |
Bridging the Gap Between Indexing and Retrieval for Differentiable Search Index with Query Generation |
| 22/09/22 |
Danushka |
IR like a SIR: Sense-enhanced Information Retrieval for Multiple Languages |
| 08/09/22 |
Danushka |
Word2Box: Capturing Set-Theoretic Semantics of Words using Box Embeddings |
| 23/08/22 |
Jodie |
MetaICL: Learning to Learn In Context |
| 02/08/22 |
Huda |
On Transferability of Prompt Tuning for Natural Language Processing and Learning to Transfer Prompts for Text Generation |
| 26/07/22 |
Michael |
Evidentiality-guided Generation for Knowledge-Intensive NLP Tasks |
| 12/07/22 |
Danushka |
Fantastically Ordered Prompts and Where to Find Them: Overcoming Few-Shot Prompt Order Sensitivity |
| 05/07/22 |
Angrosh |
IsoBN: Fine-Tuning BERT with Isotropic Batch Normalization |
| 28/06/22 |
Jodie |
Nibbling at the Hard Core of Word Sense Disambiguation |
| 14/06/22 |
Huda |
Learning to Borrow – Relation Representation for Without-Mention Entity-Pairs for Knowledge Graph Completion |
| 07/06/22 |
Michael |
Meta-learning via Language Model In-context Tuning |
| 17/05/22 |
Danushka |
ACL summary |
| 10/05/22 |
Angrosh |
Label Verbalization and Entailment for Effective Zero- and Few-Shot Relation Extraction |
| 26/05/22 |
Huda |
Learning to Borrow – Relation Representation for Without-Mention Entity-Pairs for Knowledge Graph Completion |
| 19/04/22 |
Jodie |
Sense Embeddings are also Biased–Evaluating Social Biases in Static and Contextualised Sense Embeddings |
| 12/04/22 |
Michael |
Position-based Prompting for Health Outcome Generation |
| 05/04/22 |
Danushka |
Generating Datasets with Pretrained Language Models |
| 29/03/22 |
Huda |
Exploring Task Difficulty for Few-Shot Relation Extraction |
| 22/03/22 |
Jodie |
Word sense disambiguation: Towards interactive context exploitation from both word and sense perspectives |
| 15/03/22 |
Michael |
CROSSFIT : A Few-shot Learning Challenge for Cosstask generalisation in NLP |
| 08/03/22 |
Danushka |
PairRE: Knowledge Graph Embeddings via Paired Relation Vectors |
| 01/03/22 |
Angrosh |
Relation Classification with Entity Type Restriction |
| 22/02/22 |
Huda |
Entity Concept-enhanced Few-shot Relation Extraction |
| 15/02/22 |
Danushka |
Avoiding Inference Heuristics in Few-shot Prompt-based Finetuning |
| 08/02/22 |
Michael |
Making Pre-trained Language Models Better Few-shot Learners |
| 02/02/22 |
Jodie |
ESC: Redesigning WSD with Extractive Sense Comprehension |
| 26/01/22 |
Huda |
CoLAKE: Contextualized Language and Knowledge Embedding |
| 19/01/22 |
Danushka |
Knowlege base completion meets transfer learning |
| 24/11/21 |
Jodie |
ConSeC: Word Sense Disambiguation as Continuous Sense Comprehension |
| 17/11/21 |
Micheal |
When does Further Pre-training MLM Help? An Empirical Study on Task-Oriented Dialog Pre-training |
| 3/11/21 |
Angrosh |
Graph Transformer Networks |
| 27/10/21 |
Huda |
Distilling Relation Embeddings from Pre-trained Language Models |
| 20/10/21 |
Danushka |
Dynamic contextualised word embeddings |
| 13/10/21 |
Jodie |
MirrorWiC: On Eliciting Word-in-Context Representations from Pretrained Language Models |
| 06/10/21 |
Micheal |
AUTOPROMPT: Eliciting Knowledge from Language Models with Automatically Generated Prompts |
| 29/09/21 |
James |
Does knowledge distillation really work? |
| 22/09/21 |
Huda |
ZS-BERT: Towards Zero-Shot Relation Extraction with Attribute Representation Learning |
| 13/09/21 |
Angrosh |
Dependency-driven RE with Attentive Graph Convolutional Networks |
| 08/09/21 |
Danushka |
Is sparse attention interpretable |
| 01/09/21 |
Jodie |
Sparsity Makes Sense: Word Sense Disambiguation Using Sparse Contextualized Word Representations |
| 25/08/21 |
Micheal |
Prefix-Tuning: Optimizing Continuous Prompts for Generation |
| 18/08/21 |
James |
Lookahead: A far-sighted alternative of magnitude-based pruning |
| 11/08/21 |
Huda |
ERICA: Improving Entity and Relation Understanding for Pre-trained Language Models via Contrastive Learning |
| 04/08/21 |
Angrosh |
VGCN-BERT: Augmenting BERT with Graph Embedding for Text Classification |