Embed Notice

HTML Code

<blockquote style="position: relative; padding-left: 55px;"><section><a href="https://sigmoid.social/users/lpag/statuses/110434485082633895">Lukas Galke (lpag@sigmoid.social)'s status on Sunday, 28-May-2023 15:59:51 JST</a><a href="https://sigmoid.social/@lpag" title="lpag@sigmoid.social"><img src="https://gnusocial.jp/avatar/132681-48-20230528065951.webp" width="48" height="48" alt="Lukas Galke" style="position: absolute; left: 0; top: 0;">Lukas Galke</a><div><a href="https://dair-community.social/@emilymbender/110427491301961735" rel="in-reply-to">in reply to</a><ul><li><li><a href="https://gnusocial.jp/user/132682" title="lampinen@sigmoid.social">Andrew Lampinen</a></li></ul></div></section><article><p><a href="https://dair-community.social/@emilymbender">@emilymbender</a> I wonder if, in general, it is fair to conclude that only because it is not imaginable that we can do something, some other learning system (as an LLM) cannot do the thing?</p><p>On related note: I would love to hear your take on <a href="https://sigmoid.social/@lampinen">@lampinen</a> et al.'s recent work: <a href="https://arxiv.org/abs/2305.16183" rel="nofollow noreferrer">https://arxiv.org/abs/2305.16183</a></p></article><footer><a rel="bookmark" href="https://gnusocial.jp/conversation/1583046#notice-3146733">In conversation</a><time datetime="2023-05-28T15:59:51+09:00" title="Sunday, 28-May-2023 15:59:51 JST">Sunday, 28-May-2023 15:59:51 JST</time> <span>from <span><a href="https://sigmoid.social/@lpag/110434485082633895" rel="external" title="Sent from sigmoid.social via ActivityPub">sigmoid.social</a></span></span><a href="https://sigmoid.social/@lpag/110434485082633895">permalink</a><h4>Attachments</h4><ol><li><article><header><div>Domain not in remote thumbnail source whitelist: static.arxiv.org</div><h5><a href="https://arxiv.org/abs/2305.16183">Passive learning of active causal strategies in agents and language models</a></h5><div></div></header><div>What can be learned about causality and experimentation from passive data?
This question is salient given recent successes of passively-trained language
models in interactive domains such as tool use. Passive learning is inherently
limited. However, we show that purely passive learning can in fact allow an
agent to learn generalizable strategies for determining and using causal
structures, as long as the agent can intervene at test time. We formally
illustrate that learning a strategy of first experimenting, then seeking goals,
can allow generalization from passive learning in principle. We then show
empirically that agents trained via imitation on expert data can indeed
generalize at test time to infer and use causal links which are never present
in the training data; these agents can also generalize experimentation
strategies to novel variable sets never observed in training. We then show that
strategies for causal intervention and exploitation can be generalized from
passive data even in a more complex environment with high-dimensional
observations, with the support of natural language explanations. Explanations
can even allow passive learners to generalize out-of-distribution from
perfectly-confounded training data. Finally, we show that language models,
trained only on passive next-word prediction, can generalize causal
intervention strategies from a few-shot prompt containing examples of
experimentation, together with explanations and reasoning. These results
highlight the surprising power of passive learning of active causal strategies,
and may help to understand the behaviors and capabilities of language models.</div><footer></footer></article></li></ol></footer></blockquote>

Corresponding Notice

Embed this notice
Lukas Galke (lpag@sigmoid.social)'s status on Sunday, 28-May-2023 15:59:51 JSTLukas Galke
in reply to
- Prof. Emily M. Bender(she/her)
- Andrew Lampinen
@emilymbender I wonder if, in general, it is fair to conclude that only because it is not imaginable that we can do something, some other learning system (as an LLM) cannot do the thing?
On related note: I would love to hear your take on @lampinen et al.'s recent work: https://arxiv.org/abs/2305.16183
In conversationSunday, 28-May-2023 15:59:51 JST from sigmoid.socialpermalink
Attachments
1. Domain not in remote thumbnail source whitelist: static.arxiv.org
  Passive learning of active causal strategies in agents and language models
  What can be learned about causality and experimentation from passive data? This question is salient given recent successes of passively-trained language models in interactive domains such as tool use. Passive learning is inherently limited. However, we show that purely passive learning can in fact allow an agent to learn generalizable strategies for determining and using causal structures, as long as the agent can intervene at test time. We formally illustrate that learning a strategy of first experimenting, then seeking goals, can allow generalization from passive learning in principle. We then show empirically that agents trained via imitation on expert data can indeed generalize at test time to infer and use causal links which are never present in the training data; these agents can also generalize experimentation strategies to novel variable sets never observed in training. We then show that strategies for causal intervention and exploitation can be generalized from passive data even in a more complex environment with high-dimensional observations, with the support of natural language explanations. Explanations can even allow passive learners to generalize out-of-distribution from perfectly-confounded training data. Finally, we show that language models, trained only on passive next-word prediction, can generalize causal intervention strategies from a few-shot prompt containing examples of experimentation, together with explanations and reasoning. These results highlight the surprising power of passive learning of active causal strategies, and may help to understand the behaviors and capabilities of language models.

Public

Embed Notice

HTML Code

Corresponding Notice