It looks like the ex-DDG employee got the details wrong, and read the slides backwards.

  • Holyginz@lemmy.world
    link
    fedilink
    arrow-up
    94
    arrow-down
    5
    ·
    1 year ago

    See the cynic in me is wondering whether it was actually a mistake like they said, or if it’s a cover up.

    • plistig@feddit.de
      link
      fedilink
      arrow-up
      85
      arrow-down
      4
      ·
      1 year ago

      “Hey, Wired, weird thing with our search algorithm. It seems to always put you on page 2 since your article dropped. I wonder why that is?” – Larry Page, probably.

    • Skull giver@popplesburger.hilciferous.nlOP
      link
      fedilink
      arrow-up
      36
      arrow-down
      2
      ·
      edit-2
      1 year ago

      My interpretation, based on a comment on orange Reddit, is that they read this slide and interpreted it the wrong way around.

      The claim was that Google was taking a query for “kids clothing” and turning it into “$brandName kids clothing” to get more ad revenue from $brandName, but the slide shows the exact opposite: “$brandName kids clothing” is turned into “kids clothing”. I can’t find much about ads, I’m not sure if the ads were ever affected by this keyword transformation, but if they are then the ads you see will be more generic (and worth less, and less profitable).

            • Skull giver@popplesburger.hilciferous.nlOP
              link
              fedilink
              arrow-up
              4
              ·
              edit-2
              1 year ago

              I assume it has to do with code filtering out attempts to inject HTML / scripts into comments. Lemmy had a bunch of bugs that allowed hackers to inject Javascript so they turned on quite an aggressive filter.

              • mindbleach@sh.itjust.works
                link
                fedilink
                arrow-up
                4
                ·
                1 year ago

                They fucked it up completely in a way that raises questions of competence.

                HTML has ways to display angle brackets specifically intended to never be interpreted as tags. “Entity names” will never be code. There’s not even a sensible way to do it deliberately, like %20 nonsense.

              • 0xD@infosec.pub
                link
                fedilink
                arrow-up
                1
                arrow-down
                2
                ·
                1 year ago

                Could have done it with proper encoding, don’t need to remove it lol o.O

                • MotoAsh@lemmy.world
                  link
                  fedilink
                  arrow-up
                  1
                  arrow-down
                  2
                  ·
                  1 year ago

                  Allowing tainted data in to the dataset means every single client has to do every single spot of content rendering correctly or else be vulnerable to easy hacking. Keeping it out of the dataset means not all clients have to be perfect for Lemmy to be a secure place.

      • xionzui@sh.itjust.works
        link
        fedilink
        arrow-up
        11
        ·
        1 year ago

        The slide shows neither. It shows that they use synonyms to get more results. They take a search for “kids clothing” and add results for “children’s clothing” and “kidswear”

      • stifle867@programming.dev
        link
        fedilink
        arrow-up
        5
        ·
        1 year ago

        My interpretation was this + in terms of the actual “sponsored” results work by matching “kids clothing” with advertisers who match for that term, and Google “changing” it into “$brand_name kids clothing” which seems entirely obvious when spelling it out.

        I haven’t used Google as my primary search engine for many years but occasionally I do run a search on it. While the quality of results is extremely low, I never noticed anything obvious like a generic search term only returning results for a specific brand + that search term like the original article implied.

        It seemed like a giant misunderstanding of how it all works from the start but made for a great headline.