Public
- Public
- Network
- Groups
- Featured
- Popular
- People

Conversation

Notices

Embed this notice
That DΔrn Pooka :verified_think: (theququ@shitposter.world)'s status on Friday, 24-May-2024 06:13:51 JST That DΔrn Pooka :verified_think:

I have not read the LLAVA paper but I am assuming it doesn't actually give any spatial information to the text model. It gets a lot of elements in the picture right, but can't determine their positioning. (Llama-3-8B is the model used).
In conversation Friday, 24-May-2024 06:13:51 JST from shitposter.world permalink
Attachments
1. Untitled attachment
  https://media.shitposter.world/shitposter.club/4dbb59fac5b36a6e92fa75aa6e99a9cbe0ac7a2cebffd57f3a548f71a17be87b.png?name=D1v-Z32QdTIEtA.png
- Blurry Moon likes this.

Feeds