• Adalast@lemmy.world
      link
      fedilink
      English
      arrow-up
      10
      ·
      13 hours ago

      Honestly, auto generating text descriptions for visually impaired people is probably one of the few potential good uses for LLM + CLIP. Being able to have a brief but accurate description without relying on some jackass to have written it is a bonefied good thing. It isn’t even eliminating anyone’s job since the jackass doesn’t always do it in the first place.

      • SGforce@lemmy.ca
        link
        fedilink
        English
        arrow-up
        1
        ·
        49 minutes ago

        The models that do that now are very capable but aren’t tuned properly IMO. They are overly flowery and sickly positive even when describing something plain. Prompting them to be more succinct only has them cut themselves off and leave out important things. But I can totally see that improving soon.

      • AWildMimicAppears@lemmy.dbzer0.com
        link
        fedilink
        English
        arrow-up
        1
        ·
        4 hours ago

        I am so sorry, and i agree with your point, but i really had a good laugh at my mental image of a bonefied good thing :-)

        If you know already or it’s autocorrect, just ignore me, if not, it’s bona fide :-)