@everyone We've had a number of papers accepted for publication recently, including both previously released ones and new ones. In lieu of many posts today, I figured batching them probably makes more sense ð I've provided some annotations to help guide which might be new to you.
Apologies if I'm missing your tag or paper! Please DM me with corrections.
EMNLP
RWKV: Reinventing RNNs for the Transformer Era (Findings) by @blinkdl @hypnopump @Quentin Anthony, et al.
trlX: A Framework for Large Scale Reinforcement Learning from Human Feedback by @alexhavrilla @maxreciprocate @duyphung.ai @1.69 @Ryan Gosling @Stella Biderman (she/her) @Quentin Anthony Ethan Kim and @ihateihatelouis paper forthcoming
NeurIPS
The Goldilocks of Pragmatic Understanding: Fine-Tuning Strategy Matters for Implicature Resolution by LLMs (spotlight) by @Laura Ruis Akbir Khan, @Stella Biderman (she/her), @sara_hooker, Tim RocktÀschel, and Edward Grefenstette major paper update
LEACE: Perfect linear concept erasure in closed form by @norabelrose @dsj @shauli_ @rcotterell @edwardraff @Stella Biderman (she/her)
Reconstructing the Mind's Eye: fMRI-to-Image with Contrastive Learning and Diffusion Priors by Scotti et al. incl @ilovescience (spotlight) previously unannounced
Emergent and Predictable Memorization in Large Language Models by @Stella Biderman (she/her) @Orz @lintangsutawika @Hailey Schoelkopf @Quentin Anthony @Ryan Gosling and @edwardraff
Math-AI Workshop
OpenWebMath: An Open Dataset of High-Quality Mathematical Web Text by @keirp @dsantosmarco @zhangir_azerbayev and Jimmy Ba
Llemma: An Open Language Model For Mathematics by @zhangir_azerbayev @Hailey Schoelkopf @keirp@dsantosmarco @mcaleste @Albert Jiang @jianga2718@Stella Biderman (she/her) @wellecks
Socially Responsible Language Modelling Research Workshop
Eliciting Language Model Behaviors using Reverse Language Models (spotlight) by Pfau et al. incl. @alexinfanger and @ai_waifunew paper
@Stella Biderman (she/her) has an invited panel.
Workshop on Backdoors in Deep Learning
Detecting Backdoors with Meta-Models by Langosco et al. incl. @Hyperion new paper
__Workshop on Attributing Model Behavior at Scale (ATTRIB) __
Sparse Autoencoders Find Highly Interpretable Features in Language Models by @hoagy @aidan ewart @loganriggs @Robert_AIZI @leesharkey
Apache Cassandra Conference
@picocreator has an invited talk on RWKV
Nature
Roleplay with Large Language Models by Murray Shanahan and @repligate previously unannounced
Preprints
The OpenELM Library by @Hyperion @Honglu Ryan Zhou, Daniel Scott, and @joellehman new paper
Meet-ups
If you want to meet up with EleutherAI check out #1171809525280550932 #1171291697561477170 #1182032181921587200 respectively.