The malicious use of artificial intelligence: Forecasting, prevention, and mitigation M Brundage, S Avin, J Clark, H Toner, P Eckersley, B Garfinkel, A Dafoe, ... arXiv preprint arXiv:1802.07228, 2018 | 1169 | 2018 |
Model evaluation for extreme risks T Shevlane, S Farquhar, B Garfinkel, M Phuong, J Whittlestone, J Leung, ... arXiv preprint arXiv:2305.15324, 2023 | 133 | 2023 |
How does the offense-defense balance scale? B Garfinkel, A Dafoe Emerging Technologies and International Stability, 247-274, 2021 | 91 | 2021 |
Democratising AI: Multiple meanings, goals, and methods E Seger, A Ovadya, D Siddarth, B Garfinkel, A Dafoe Proceedings of the 2023 AAAI/ACM Conference on AI, Ethics, and Society, 715-722, 2023 | 63 | 2023 |
Towards best practices in AGI safety and governance: A survey of expert opinion J Schuett, N Dreksler, M Anderljung, D McCaffary, L Heim, E Bluemke, ... arXiv preprint arXiv:2305.07153, 2023 | 43 | 2023 |
The windfall clause: Distributing the benefits of AI for the common good C O'Keefe, P Cihon, B Garfinkel, C Flynn, J Leung, A Dafoe Proceedings of the AAAI/ACM Conference on AI, Ethics, and Society, 327-331, 2020 | 39 | 2020 |
Open-sourcing highly capable foundation models: An evaluation of risks, benefits, and alternative methods for pursuing open-source objectives E Seger, N Dreksler, R Moulange, E Dardaman, J Schuett, K Wei, ... arXiv preprint arXiv:2311.09227, 2023 | 36 | 2023 |
Beyond privacy trade-offs with structured transparency A Trask, E Bluemke, T Collins, BGE Drexler, CG Cuervas-Mons, I Gabriel, ... arXiv preprint arXiv:2012.08347, 2020 | 30 | 2020 |
& Amodei, D.(2018) M Brundage, S Avin, J Clark, H Toner, P Eckersley, B Garfinkel, A DAFOE, ... The malicious use of artificial intelligence: Forecasting, prevention, and …, 1802 | 20 | 1802 |
The malicious use of artificial intelligence: forecasting, prevention, and mitigation. Future of Humanity Institute, University of Oxford, Centre for the Study of Existential … M Brundage, S Avin, J Clark, H Toner, P Eckersley, B Garfinkel, D Amodei Center for a New American Security, Electronic Frontier Foundation, OpenAI 1 …, 2018 | 19 | 2018 |
The windfall clause: Distributing the benefits of AI, Centre for the governance of AI research report C O’Keefe, P Cihon, C Flynn, B Garfinkel, J Leung, A Dafoe Future of Humanity Institute, University of Oxford. https://www. fhi. ox. ac …, 2020 | 14 | 2020 |
Open problems in technical ai governance A Reuel, B Bucknall, S Casper, T Fist, L Soder, O Aarne, L Hammond, ... arXiv preprint arXiv:2407.14981, 2024 | 13 | 2024 |
AI policy levers: A review of the US government’s tools to shape AI research, development, and deployment SC Fischer, J Leung, M Anderljung, C O’keefe, S Torges, SM Khan, ... Retrieved June 1, 2022, 2021 | 9 | 2021 |
From principles to rules: A regulatory approach for frontier AI J Schuett, M Anderljung, A Carlier, L Koessler, B Garfinkel arXiv preprint arXiv:2407.07300, 2024 | 8 | 2024 |
Exploring the Relevance of Data Privacy-Enhancing Technologies for AI Governance Use Cases E Bluemke, T Collins, B Garfinkel, A Trask arXiv preprint arXiv:2303.08956, 2023 | 8 | 2023 |
Contact tracing apps can help stop coronavirus. But they can hurt privacy T Shevlane, B Garfinkel, A Dafoe The Washington Post, 2020 | 7 | 2020 |
On the impossibility of supersized machines B Garfinkel, M Brundage, D Filan, C Flynn, J Luketina, M Page, ... arXiv preprint arXiv:1703.10987, 2017 | 6 | 2017 |
The impact of artificial intelligence B Garfinkel The Oxford handbook of AI governance, 2022 | 5 | 2022 |
Towards best practices in AGI safety and governance J Schuett, N Dreksler, M Anderljung, D McCaffary, L Heim, E Bluemke, ... Surv. Expert Opin., 2023 | 4 | 2023 |
Open-sourcing highly capable foundation models E Seger, N Dreksler, R Moulange, E Dardaman, J Schuett, K Wei, ... Research paper, Centre for the Governance of AI, 2023 | 3 | 2023 |