• Adalast@lemmy.world
    link
    fedilink
    English
    arrow-up
    10
    ·
    13 hours ago

    Honestly, auto generating text descriptions for visually impaired people is probably one of the few potential good uses for LLM + CLIP. Being able to have a brief but accurate description without relying on some jackass to have written it is a bonefied good thing. It isn’t even eliminating anyone’s job since the jackass doesn’t always do it in the first place.

    • SGforce@lemmy.ca
      link
      fedilink
      English
      arrow-up
      1
      ·
      54 minutes ago

      The models that do that now are very capable but aren’t tuned properly IMO. They are overly flowery and sickly positive even when describing something plain. Prompting them to be more succinct only has them cut themselves off and leave out important things. But I can totally see that improving soon.

    • AWildMimicAppears@lemmy.dbzer0.com
      link
      fedilink
      English
      arrow-up
      1
      ·
      4 hours ago

      I am so sorry, and i agree with your point, but i really had a good laugh at my mental image of a bonefied good thing :-)

      If you know already or it’s autocorrect, just ignore me, if not, it’s bona fide :-)