Follow
Iason Gabriel
Iason Gabriel
Senior Staff Research Scientist, Google DeepMind
Verified email at google.com
Title
Cited by
Cited by
Year
Scaling language models: Methods, analysis & insights from training gopher
JW Rae, S Borgeaud, T Cai, K Millican, J Hoffmann, F Song, J Aslanides, ...
arXiv preprint arXiv:2112.11446, 2021
10812021
Ethical and social risks of harm from language models
L Weidinger, J Mellor, M Rauh, C Griffin, J Uesato, PS Huang, M Cheng, ...
arXiv preprint arXiv:2112.04359, 2021
9702021
Artificial intelligence, values, and alignment
I Gabriel
Minds and machines 30 (3), 411-437, 2020
7222020
Taxonomy of risks posed by language models
L Weidinger, J Uesato, M Rauh, C Griffin, PS Huang, J Mellor, A Glaese, ...
Proceedings of the 2022 ACM Conference on Fairness, Accountability, and …, 2022
5562022
Improving alignment of dialogue agents via targeted human judgements
A Glaese, N McAleese, M Trębacz, J Aslanides, V Firoiu, T Ewalds, ...
arXiv preprint arXiv:2209.14375, 2022
4642022
Power to the people? Opportunities and challenges for participatory AI
A Birhane, W Isaac, V Prabhakaran, M Diaz, MC Elish, I Gabriel, ...
Proceedings of the 2nd ACM Conference on Equity and Access in Algorithms …, 2022
2262022
Alignment of language agents
Z Kenton, T Everitt, L Weidinger, I Gabriel, V Mikulik, G Irving
arXiv preprint arXiv:2103.14659, 2021
1602021
Effective altruism and its critics
I Gabriel
Journal of Applied Philosophy 34 (4), 457-473, 2017
1592017
Model evaluation for extreme risks
T Shevlane, S Farquhar, B Garfinkel, M Phuong, J Whittlestone, J Leung, ...
arXiv preprint arXiv:2305.15324, 2023
1382023
In conversation with artificial intelligence: aligning language models with human values
A Kasirzadeh, I Gabriel
Philosophy & Technology 36 (2), 27, 2023
1202023
Sociotechnical safety evaluation of generative ai systems
L Weidinger, M Rauh, N Marchal, A Manzini, LA Hendricks, ...
arXiv preprint arXiv:2310.11986, 2023
1082023
Toward a theory of justice for artificial intelligence
I Gabriel
Daedalus 151 (2), 218-231, 2022
792022
The Challenge of Value Alignment
I Gabriel, V Ghazavi
The Oxford Handbook of Digital Ethics, 2022
56*2022
A human rights-based approach to responsible AI
V Prabhakaran, M Mitchell, T Gebru, I Gabriel
arXiv preprint arXiv:2210.02667, 2022
512022
Characteristics of harmful text: Towards rigorous benchmarking of language models
M Rauh, J Mellor, J Uesato, PS Huang, J Welbl, L Weidinger, S Dathathri, ...
Advances in Neural Information Processing Systems 35, 24720-24739, 2022
482022
The ethics of advanced ai assistants
I Gabriel, A Manzini, G Keeling, LA Hendricks, V Rieser, H Iqbal, ...
arXiv preprint arXiv:2404.16244, 2024
342024
Using the Veil of Ignorance to align AI systems with principles of justice
L Weidinger, KR McKee, R Everett, S Huang, TO Zhu, MJ Chadwick, ...
Proceedings of the National Academy of Sciences 120 (18), e2213709120, 2023
332023
Beyond privacy trade-offs with structured transparency
A Trask, E Bluemke, T Collins, BGE Drexler, CG Cuervas-Mons, I Gabriel, ...
arXiv preprint arXiv:2012.08347, 2020
312020
Permissible secrets
H Lazenby, I Gabriel
The Philosophical Quarterly 68 (271), 265-285, 2018
222018
STELA: a community-centred approach to norm elicitation for AI alignment
S Bergman, N Marchal, J Mellor, S Mohamed, I Gabriel, W Isaac
Scientific Reports 14 (1), 6616, 2024
182024
The system can't perform the operation now. Try again later.
Articles 1–20