2k followers 0 článkov/týždeň
September 2024 Newsletter

MIRI updates Aaron Scher and Joe Collman have joined the Technical Governance Team at MIRI as researchers. Aaron previously did independent research related to sycophancy in language models and mechanistic interpretability, while Joe previously did independent research related to AI safety via debate and contributed to field-building work at MATS...

Tue Sep 17, 2024 00:25
July 2024 Newsletter

Update (7-15-2024): this newsletter originally included Eliezer’s interview with Bloomberg’s Nate Lanxon and Jackie Davalos, which I mistakenly thought was released recently. That interview was actually released last year. MIRI updates Rob Bensinger suggests that AI risk discourse could be improved by adopting a new set of labels for different...

Thu Jul 11, 2024 02:03
June 2024 Newsletter

MIRI updates MIRI Communications Manager Gretta Duleba explains MIRI’s current communications strategy. We hope to clearly communicate to policymakers and the general public why there’s an urgent need to shut down frontier AI development, and make the case for installing an “off-switch”. This will not be easy, and there is a lot of work...

Sat Jun 15, 2024 05:42
MIRI 2024 Communications Strategy

As we explained in our MIRI 2024 Mission and Strategy update, MIRI has pivoted to prioritize policy, communications, and technical governance research over technical alignment research. This follow-up post goes into detail about our communications strategy. The Objective: Shut it Down1 Our objective is to convince major powers to shut down the development...

Wed May 29, 2024 23:26
May 2024 Newsletter

Update (5-15-2024): I wrote that “it appears that not all of the leading AI labs are honoring the voluntary agreements they made at [AI Safety Summit],” citing a Politico article. However, after seeing more discussion about it (e.g. here), I am now highly uncertain about whether the labs made specific commitments, what those commitments were, and...

Wed May 15, 2024 04:07
April 2024 Newsletter

The MIRI Newsletter is back in action after a hiatus since July 2022. To recap some of the biggest MIRI developments since then: MIRI released its 2024 Mission and Strategy Update, announcing a major shift in focus: While we’re continuing to support various technical research programs at MIRI, our new top priority is broad public communication...

Sat Apr 13, 2024 05:42

Vytvorte si vlastný informačný kanál

Ste pripravení to vyskúšať?
Začnite 14-dňovú skúšobnú verziu, kreditná karta sa nevyžaduje.

Založiť účet