Carolyn Jane Anderson

Manuscripts

PhD Knowledge Not Required: A Reasoning Challenge for Large Language Models

Carolyn Jane Anderson, Joydeep Biswas, Aleksander Boruch-Gruszecki, Federico Cassano, Molly Q Feldman, Arjun Guha, Francesca Lucchetti, Zixuan Wu

Untangling classes of context-sensitivity: a closer look at the semantics of American English tomorrow.

Carolyn Jane Anderson.

The andative and venitive construction in San Lucas Quiaviní Zapotec.

Carolyn Jane Anderson. 2017. Ms.

StarCoder 2 and The Stack v2: The Next Generation

Anton Lozhkov, Raymond Li, Loubna Ben Allal, Federico Cassano, Joel Lamy-Poirier, Nouamane Tazi, Ao Tang, Dmytro Pykhtar, Jiawei Liu, Yuxiang Wei, Tianyang Liu, Max Tian, Denis Kocetkov, Arthur Zucker, Younes Belkada, Zijian Wang, Qian Liu, Dmitry Abulkhanov, Indraneil Paul, Zhuang Li, Wen-Ding Li, Megan Risdal, Jia Li, Jian Zhu, Terry Yue Zhuo, Evgenii Zheltonozhskii, Nii Osae Osae Dade, Wenhao Yu, Lucas Krauß, Naman Jain, Yixuan Su, Xuanli He, Manan Dey, Edoardo Abati, Yekun Chai, Niklas Muennighoff, Xiangru Tang, Muhtasham Oblokulov, Christopher Akiki, Marc Marone, Chenghao Mou, Mayank Mishra, Alex Gu, Binyuan Hui, Tri Dao, Armel Zebaze, Olivier Dehaene, Nicolas Patry, Canwen Xu, Julian McAuley, Han Hu, Torsten Scholak, Sebastien Paquet, Jennifer Robinson, Carolyn Jane Anderson, Nicolas Chapados, Mostofa Patwary, Nima Tajbakhsh, Yacine Jernite, Carlos Muñoz Ferrandis, Lingming Zhang, Sean Hughes, Thomas Wolf, Arjun Guha, Leandro von Werra, Harm de Vries.

Manuscripts

PhD Knowledge Not Required: A Reasoning Challenge for Large Language Models

Untangling classes of context-sensitivity: a closer look at the semantics of American English tomorrow.

The andative and venitive construction in San Lucas Quiaviní Zapotec.

StarCoder 2 and The Stack v2: The Next Generation

2025

Anaphoric Relations and Quoting Out of Context

GlyphPattern: An Abstract Pattern Recognition for Vision-Language Models

The Future of Programming in the Age of Large Language Models

“I Would Have Written My Code Differently”: Beginners Struggle to Understand LLM-Generated Code

Effects of Text-Formatting on Speaker-Related Pragmatic Inferences

Components of Character: Exploring the Computational Similarity of Austen's Characters

Perspective Shift with Korean Motion Verbs

But *How* Do They Use It? Scaffolding the Introduction of Generative AI Across the SLAC Curriculum

Substance Beats Style: Why Beginning Students Fail to Code with LLMs

2024

Evaluating Computational Representations of Character: An Austen Character Similarity Benchmark

Can It Edit? Evaluating the Ability of Large Language Models to Follow Code Editing Instructions

What Parenthesized Modifiers (May) Mean

Knowledge Transfer from High-Resource to Low-Resource Programming Languages for Code LLMs

Exploring Language Representation through a Resource Inventory Project

A Prompting Assignment for Exploring Pretrained LLMs

StudentEval: a Benchmark of Student-Written Prompts for Large Language Models of Code

Non-Expert Programmers in the Generative AI Future

How Beginning Programmers and Code LLMs (Mis)read Each Other

2023

StarCoder: May the Source Be With You!

Protagonist-mediated perspective

Solving and Generating NPR Sunday Puzzles with Large Language Models

MultiPL-E: A Scalable and Extensible Approach to Benchmarking Neural Code Generation

Do All Minority Languages Look the Same to Chat-GPT? Linguistic (Mis)information in a Large Language Model.

Cross-linguistic differences in processing parentheticals between English and Korean.

SantaCoder: Don’t Reach For the Stars!

Grammatical perspective-taking in comprehension and production.

Exploring Social Biases of Large Language Models in a College Artificial Intelligence Course

2022

Eliciting Associated Motion Constructions in Two Zapotec Languages

Protagonist-Mediated Perspective

(Some) parentheses are focus-sensitive operators

2021

ProSPer: Probing Human and Neural Network Language Model Understanding of Spatial Perspective.

Solver-based Gradual Type Migration.

Tell Me Everything You Know: A Conversation Update System for the Rational Speech Acts Framework

Coming in, or going out? Measuring the effect of discourse factors on perspective prominence

Diagnosing the semantics of perspectival expressions

2020

Shifting the Perspectival Landscape: Methods for Encoding, Identifying, and Selecting Perspectives.

Can neural network language models understand spatial perspective?

2019

Guess Who's Coming (And Who's Going): Bringing Perspective to the Rational Speech Acts Framework.

"Tomorrow" Isn't Always A Day Away.

Taking other perspectives into account: an RSA model of perspectival reasoning.

Explaining the progressive motion verb puzzle in Zapotec.

2018

"Tomorrow" Isn't Always A Day Away.

The San Lucas Quiaviní Zapotec Andative and Venitive.

2017

The Andative and Venitive Construction in San Lucas Quiaviní Zapotec.

2016

Negation in Colonial Valley Zapotec.

2015

The Morphosyntax of Negation in Colonial Valley Zapotec.

2014

NetKAT: Semantic Foundations for Networks.

La morfosintaxis de la negation en el zapoteco del Valle colonial.

"I talk it and I feel it": Language attitudes of Moroccan university students

2013

Language Ideology and Human Rights Doctrine in Morocco.

But How Do They Use It? Scaffolding the Introduction of Generative AI Across the SLAC Curriculum