”Even the strongest critics — e.g., Eliezer Yudkowsky and colleagues at MIRI

Even the strongest critics — e.g., Eliezer Yudkowsky and colleagues at MIRI — call for more work in mechanistic interpretability, corrigibility and explainability. The issue is the gap: an order of magnitude more resources go to capabilities than to safety.

2025-08-22 Interview with Andreas Hepp

顯示前後文Show context

鍵盤快捷鍵Keyboard shortcuts