• MacN'Cheezus@lemmy.todayOP
      link
      fedilink
      English
      arrow-up
      33
      arrow-down
      1
      ·
      edit-2
      6 months ago

      To be fair, it did spit out a couple of completely nonsensical images afterwards:

      I think the AI might be biased against dogs.

      • lseif@sopuli.xyz
        link
        fedilink
        arrow-up
        8
        ·
        6 months ago

        did it give you the images in base64 from an llm, or from an image generation model ?

        • ChaoticNeutralCzech@lemmy.ml
          link
          fedilink
          arrow-up
          7
          ·
          edit-2
          6 months ago

          I think you can guess that part. I doubt a current LLM can create a valid PNG, even if it’s just a 1x1px one that has been created before. This is partially because PNGs have a checksum and the LLM has definitely not seen enough PNGs in base64 to figure out the algorithm, and is not optimized to calculate checksums. In fact, I analyzed the image and the image header checksum is wrong even though the header makes sense (was likely stolen). Also, it gets penalized for repetition, which occurs a lot in image headers.

          AFAIK, the smallest valid image you see mentioned on the web is a 35-byte transparent pixel GIF, and the smallest PNG is a black pixel with 67 bytes:

          data:image/gif;base64,R0lGODlhAQABAAAAACH5BAEAAAAALAAAAAABAAEAAAIBAAA=
          data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAEAAAABAQAAAAA3bvkkAAAACklEQVR4AWNgAAAAAgABc3UBGAAAAABJRU5ErkJggg==
          

          Testing rendering: Alt text for the GIF; if you see it, it failed, Alt text for the PNG; if you see it, it failed, another 67-byte PNG but 8 px wide: , or 1 gray pixel: , or a green one:

          The article + the generator