@feditips That’s from scratch text-to-image generation, not captioning.
Eg, VoiceOver can run on iOS and basically describes what you see in your camera app in real time. Were it using as much energy as it takes to charge your phone, I imagine you’d notice.
I don’t necessarily know what tools what software is using to caption images, but most people can run some form of captioning locally without the CPU breaking a sweat.
GNU social JP is a social network, courtesy of GNU social JP管理人. It runs on GNU social, version 2.0.2-dev, available under the GNU Affero General Public License.
All GNU social JP content and data are available under the Creative Commons Attribution 3.0 license.