The necessity of machine learning theory in…

Misha Belkin

Jul 29, 2023

Mitigating AI risk has become a topic of intense effort in recent years and months.

Read →

9 Comments

Amartya

Jul 30, 2023

Indeed a great motivation for working towards theory :)

Expand full comment

Wayne

Aug 28, 2023

Possible the most well-written and rigorous piece on safety I've read so far, may or may not changed my view on safety research as a whole! We indeed could really use more theories to cut through ambiguities and connect speculations with the empiricals.

Expand full comment

Shuai Zhao

Nov 7, 2023

Although it's necessary, but very difficult.

Expand full comment

Reply (1)

Misha Belkin

Nov 7, 2023

I am hopeful it might be easier than it seems.

Expand full comment

Dinesh

Jan 9, 2024

A great piece about security, Can we get in contact to discuss further?

Expand full comment

Ivan

Feb 14, 2024

> Theory in this context refers to identifying precise measurable quantities and mathematically describing their patterns, the way it is used in physics and engineering, rather than proving rigorous theorems. ~ there are many instances when internal developments in pure math later became very useful in physics and engineering. Maybe, there are already existing pieces of pure math knowledge which could be useful for mathematical descriptions of LLMs? To possible detect them, an interdiscplinary interaction between AI experts and interested pure mathematicians is required.

Expand full comment

Reply (1)

Misha Belkin

Feb 14, 2024

I am very much in favor of bringing in mathematicians, applied or pure. Still, we need to treat these phenomena as physics -- mathematical theories need to have explanatory/predictive power to be of use.

Expand full comment

Reply (1)

Ivan

Feb 15, 2024

There is a need for math theories or math vision which provide a kind of structured concentrated understanding and prediction of the central issues of LLM. Of course, in view of the fast pace of LLM developments, producing full rigorous math proofs will have to wait for some decades and hundreds of PhD math theses. What would be 7-10 LLM papers or talks/posts for interested mathematicians to have a look at in order to go fast into the heart of the matter?

Expand full comment

Shabari S Nair

Mar 24Edited

I thoroughly enjoyed this post and its compelling arguments for taking a 'theoretical' approach to modern ML. It is indeed quite scary the pace with which LLMs came to be so widely adopted in recent years, with most organizations having no issues with its completely 'black box' nature. It would be great to have a set of guiding principles for understanding or developing new deep-learning architectures, especially since new ground-breaking architectures are being devised every other day, and its very unclear which model works best for which use case. Perhaps this will also bring some much-needed grounding to a field that is progressing at an unprecedented pace.

Expand full comment

Data, Machine Learning and AI

The necessity of machine learning theory in…