9 Comments
User's avatar
Amartya's avatar

Indeed a great motivation for working towards theory :)

Expand full comment
Wayne's avatar

Possible the most well-written and rigorous piece on safety I've read so far, may or may not changed my view on safety research as a whole! We indeed could really use more theories to cut through ambiguities and connect speculations with the empiricals.

Expand full comment
Shuai Zhao's avatar

Although it's necessary, but very difficult.

Expand full comment
Misha Belkin's avatar

I am hopeful it might be easier than it seems.

Expand full comment
Dinesh's avatar

A great piece about security, Can we get in contact to discuss further?

Expand full comment
Ivan's avatar

> Theory in this context refers to identifying precise measurable quantities and mathematically describing their patterns, the way it is used in physics and engineering, rather than proving rigorous theorems. ~ there are many instances when internal developments in pure math later became very useful in physics and engineering. Maybe, there are already existing pieces of pure math knowledge which could be useful for mathematical descriptions of LLMs? To possible detect them, an interdiscplinary interaction between AI experts and interested pure mathematicians is required.

Expand full comment
Misha Belkin's avatar

I am very much in favor of bringing in mathematicians, applied or pure. Still, we need to treat these phenomena as physics -- mathematical theories need to have explanatory/predictive power to be of use.

Expand full comment
Ivan's avatar

There is a need for math theories or math vision which provide a kind of structured concentrated understanding and prediction of the central issues of LLM. Of course, in view of the fast pace of LLM developments, producing full rigorous math proofs will have to wait for some decades and hundreds of PhD math theses. What would be 7-10 LLM papers or talks/posts for interested mathematicians to have a look at in order to go fast into the heart of the matter?

Expand full comment
Shabari S Nair's avatar

I thoroughly enjoyed this post and its compelling arguments for taking a 'theoretical' approach to modern ML. It is indeed quite scary the pace with which LLMs came to be so widely adopted in recent years, with most organizations having no issues with its completely 'black box' nature. It would be great to have a set of guiding principles for understanding or developing new deep-learning architectures, especially since new ground-breaking architectures are being devised every other day, and its very unclear which model works best for which use case. Perhaps this will also bring some much-needed grounding to a field that is progressing at an unprecedented pace.

Expand full comment