Conversation

Notices

Embed this notice
Victoria Stuart 🇨🇦 🏳️‍⚧️ (persagen@mastodon.social)'s status on Wednesday, 13-Sep-2023 15:58:15 JST Victoria Stuart 🇨🇦 🏳️‍⚧️

Addendum 8
Instruction tuning: https://en.wikipedia.org/wiki/Large_language_model#Instruction_tuning
* self-instruct approaches
* enable LLM to bootstrap correct responses
Instruction Tuning for Large Language Models: Survey
https://arxiv.org/abs/2308.10792
*instruction tuning (IT): further supervised training of LLMs on dataset of (instruction, output) pairs
* enhances capabilities, control of LLM
* bridges gap betw. next-word prediction obj. of LLM & users' obj. of LLM adhering to human instructions
#LLM #LargeLanguageModels #InstructionTuning
In conversation about a year ago from mastodon.social permalink
Attachments
1. Instruction Tuning for Large Language Models: A Survey Figure 1: General pipeline of instruction tuning. https://arxiv.org/abs/2308.10792
  https://files.mastodon.social/media_attachments/files/110/945/427/987/026/317/original/32e7b1bd2b33626c.png
2. Domain not in remote thumbnail source whitelist: upload.wikimedia.org
  
  Large language model
  
  A large language model (LLM) is a language model characterized by its large size. Their size is enabled by AI accelerators, which are able to process vast amounts of text data, mostly scraped from the Internet. The artificial neural networks which are built can contain from tens of millions and up to billions of weights and are (pre-)trained using self-supervised learning and semi-supervised learning. Transformer architecture contributed to faster training. Alternative architectures include the mixture of experts (MoE), which has been proposed by Google, starting with sparsely-gated ones in 2017, Gshard in 2021 to GLaM in 2022.As language models, they work by taking an input text and repeatedly predicting the next token or word. Up to 2020, fine tuning was the only way a model could be adapted to be able to accomplish specific tasks. Larger sized models, such as GPT-3, however, can be prompt-engineered to achieve similar results. They are thought to acquire embodied knowledge about syntax, semantics and "ontology" inherent in human language corpora, but also inaccuracies and biases...
3. Untitled attachment

Public

Conversation

Notices

Feeds