Public
- Public
- Network
- Groups
- Featured
- Popular
- People

Conversation

Notices

Embed this notice
Simon Willison (simon@fedi.simonwillison.net)'s status on Tuesday, 10-Dec-2024 04:53:10 JST Simon Willison
in reply to

This consistent trend of models getting smaller, faster and more capable on the same hardware is one of the reasons I'm not particularly concerned by the ongoing discourse about models hitting a plateau
https://simonwillison.net/2024/Dec/9/llama-33-70b/#is-performance-about-to-plateau-
In conversation about a month ago from fedi.simonwillison.net permalink
Attachments
1. Is performance about to plateau? # I’ve been mostly unconvinced by the ongoing discourse around LLMs hitting a plateau. The areas I’m personally most excited about are multi-modality (images, audio and video as input) and model efficiency. Both of those have had enormous leaps forward in the past year. I don’t particularly care about “AGI”. I want models that can do useful things that I tell them to, quickly and inexpensively—and that’s exactly what I’ve been getting more of over the past twelve months. Even if progress on these tools entirely stopped right now, the amount I could get done with just the models I’ve downloaded and stashed on a USB drive would keep me busy and productive for years.
  https://cdn.masto.host/fedisimonwillisonnet/media_attachments/files/113/624/603/584/330/272/original/5269aa79cc9eec0d.png
2. Domain not in remote thumbnail source whitelist: static.simonwillison.net
  
  I can now run a GPT-4 class model on my laptop
  
  from @simonw
  
  Meta’s new Llama 3.3 70B is a genuinely GPT-4 class Large Language Model that runs on my laptop. Just 20 months ago I was amazed to see something that felt …
- Embed this notice
  Simon Willison (simon@fedi.simonwillison.net)'s status on Tuesday, 10-Dec-2024 04:53:13 JST Simon Willison
  
  I can now run a GPT-4 class model on my laptop
  (The exact same laptop that could just about run a GPT-3 class model 20 months ago)
  The new Llama 3.3 70B is a striking example of the huge efficiency gains we've seen in the last two years
  https://simonwillison.net/2024/Dec/9/llama-33-70b/
  In conversation about a month ago permalink
  Attachments
  1. Domain not in remote thumbnail source whitelist: static.simonwillison.net
    
    I can now run a GPT-4 class model on my laptop
    
    from @simonw
    
    Meta’s new Llama 3.3 70B is a genuinely GPT-4 class Large Language Model that runs on my laptop. Just 20 months ago I was amazed to see something that felt …
  Tim Chambers repeated this.

Public

Conversation

Notices

Feeds