Intrinsic evaluation of unlearning using parametric knowledge traces Y Hong, L Yu, H Yang, S Ravfogel, M Geva arXiv preprint arXiv:2406.11614, 2024 | 12 | 2024 |
A computational analysis of crosslinguistic regularity in semantic change O Fugikawa, O Hayman, R Liu, L Yu, T Brochhagen, Y Xu Frontiers in Communication 8, 1136338, 2023 | 11 | 2023 |
Emergence of a high-dimensional abstraction phase in language transformers E Cheng, D Doimo, C Kervadec, I Macocco, J Yu, A Laio, M Baroni arXiv preprint arXiv:2405.15471, 2024 | 10 | 2024 |
Drlc: Reinforcement learning with dense rewards from llm critic M Cao, L Shu, L Yu, Y Zhu, N Wichers, Y Liu, L Meng arXiv e-prints, arXiv: 2401.07382, 2024 | 9 | 2024 |
Predicting emergent linguistic compositions through time: Syntactic frame extension via multimodal chaining L Yu, Y Xu arXiv preprint arXiv:2109.04652, 2021 | 9 | 2021 |
Rolecraft-glm: Advancing personalized role-playing in large language models M Tao, X Liang, T Shi, L Yu, Y Xie arXiv preprint arXiv:2401.09432, 2023 | 8 | 2023 |
Enhancing reinforcement learning with dense rewards from language model critic M Cao, L Shu, L Yu, Y Zhu, N Wichers, Y Liu, L Meng Proceedings of the 2024 Conference on Empirical Methods in Natural Language …, 2024 | 5 | 2024 |
Robust LLM safeguarding via refusal feature adversarial training L Yu, V Do, K Hambardzumyan, N Cancedda arXiv preprint arXiv:2409.20089, 2024 | 5 | 2024 |
Mechanistic understanding and mitigation of language model non-factual hallucinations L Yu, M Cao, JCK Cheung, Y Dong arXiv preprint arXiv:2403.18167, 2024 | 5 | 2024 |
Mechanisms of non-factual hallucinations in language models L Yu, M Cao, JC Kit Cheung, Y Dong arXiv e-prints, arXiv: 2403.18167, 2024 | 5 | 2024 |
Beyond Sparse Rewards: Enhancing Reinforcement Learning with Language Model Critique in Text Generation M Cao, L Shu, L Yu, Y Zhu, N Wichers, Y Liu, L Meng arXiv preprint arXiv:2401.07382, 2024 | 4 | 2024 |
Word sense extension L Yu, Y Xu arXiv preprint arXiv:2306.05609, 2023 | 4 | 2023 |
Inferring symmetry in natural language C Tanchip, L Yu, A Xu, Y Xu arXiv preprint arXiv:2010.08090, 2020 | 4 | 2020 |
How nouns surface as verbs: Inference and generation in word class conversion L Yu, LE Sanyoura, Y Xu Proceedings of the Annual Meeting of the Cognitive Science Society 42, 2020 | 3 | 2020 |
Functional faithfulness in the wild: Circuit discovery with differentiable computation graph pruning L Yu, J Niu, Z Zhu, G Penn arXiv preprint arXiv:2407.03779, 2024 | 2 | 2024 |
Systematic word meta-sense extension L Yu arXiv preprint arXiv:2311.13029, 2023 | 2 | 2023 |
Noun2Verb: Probabilistic frame semantics for word class conversion L Yu, Y Xu Computational Linguistics 48 (4), 783-818, 2022 | 1 | 2022 |
Infinite mixture chaining: Efficient temporal construction of word meaning L Yu, Y Xu Proceedings of the Annual Meeting of the Cognitive Science Society 44 (44), 2022 | 1 | 2022 |
Calibrating Verbal Uncertainty as a Linear Feature to Reduce Hallucinations Z Ji, L Yu, Y Koishekenov, Y Bang, A Hartshorn, A Schelten, C Zhang, ... arXiv preprint arXiv:2503.14477, 2025 | | 2025 |
Infinite Mixture Chaining: An Efficiency-Based Framework for the Dynamic Construction of Word Meaning L Yu, Y Xu Open Mind 9, 1-24, 2025 | | 2025 |