• go $fsck yourself@lemmy.world
    link
    fedilink
    English
    arrow-up
    2
    ·
    edit-2
    4 months ago

    I always thought it was scummy as fuck that WordPress.org, a 501c3 nonprofit, is allowed to funnel business to WordPress.com which is a completely separate for-profit entity.

    They are even allowed to trick people into thinking they are the same by using the name and trademarks, which they explicitly state you cannot do. But wp.com gets a free pass for some reason? Scummy as fuck.

    • IllNess@infosec.pub
      link
      fedilink
      English
      arrow-up
      1
      ·
      4 months ago

      All these AI and machine learning companies are taking content directly from websites and ignoring robot.txt files.

      If your content is able to be crawled, even without being listed on search engines, I don’t think it really matters.

      • T156@lemmy.world
        link
        fedilink
        English
        arrow-up
        0
        ·
        4 months ago

        It might help proof an AI company against legal issues that might be brought about by their using the content. If they’re ever sued by Automattic, then they can just point to the deal and say that they bought the data from them. There’s much less ambiguity.

        • IllNess@infosec.pub
          link
          fedilink
          English
          arrow-up
          1
          ·
          4 months ago

          You are correct, about the legal stuff. These companies are being sued all the time.

          Doing this deal also makes processing the data a lot easier. Being handed a big ass database would be a lot easier than crawling for content.

          What I posted was about how they operate. These companies showed time and time again that they don’t really care what data they are taking or from whom. They will even take their own AI or machine learning content and put it in their own system.

  • RizzRustbolt@lemmy.world
    link
    fedilink
    English
    arrow-up
    1
    ·
    4 months ago

    Matt’s selling it.

    The teams at Wordpress and Tumblr have made it known that they absolutely don’t want this shit.

  • herrcaptain@lemmy.ca
    link
    fedilink
    English
    arrow-up
    1
    ·
    4 months ago

    I’m assuming this just relates to WordPress.com rather than the open-source WordPress.org but it’s still a bummer. I’ve worked with the open source platform for over a dozen years and have started to kinda loathe what it’s turned into but I’m not sure I’m yet at the point where I’m ready to migrate a bunch of sites to something else. This could be that push if they keep going down this road.

    God, am I getting too old for this shit? I’m a pretty technical person but this AI nonsense is just relentless. I’m not philosophically against the idea of AI as like any tool it has the potential to better the world, but every tech company and their dog are going all in on using it for commercial bullshit that seems to provide very little value to society. Even fucking Mozilla is going in that direction.

    • Traister101@lemmy.today
      link
      fedilink
      English
      arrow-up
      0
      ·
      edit-2
      4 months ago

      It’s the new NFTs and Crypto but it’s not blatantly a scam so the companies that skipped out on those sure as shit will be hoping onto AI

  • FrostKing@lemmy.world
    link
    fedilink
    English
    arrow-up
    1
    ·
    4 months ago

    Can someone please outline the main reasons people are upset with these sites for choosing to do this?

    • Ultraviolet@lemmy.world
      link
      fedilink
      English
      arrow-up
      2
      ·
      4 months ago

      There are 3 very important things that have to be respected when using someone’s work. Consent, credit, and compensation. The data is being taken without the consent of users, they’re not being credited for anything, and they don’t receive so much as a cent in exchange.

  • donuts@kbin.social
    link
    fedilink
    arrow-up
    1
    ·
    4 months ago

    Funny how all of these social media platforms that were so happy to describe themselves as “the public town square of the internet” or whatever are now claiming that they own everything that everyone ever posted. So, which is it? Because it obviously cannot be both.

  • Nikelui@piefed.social
    link
    fedilink
    arrow-up
    0
    ·
    4 months ago

    I wonder if there is a text equivalent of Glaze and Nightshade, to perform adversarial attacks on AI scraping the text.

  • LunaCtld@lemmy.world
    link
    fedilink
    English
    arrow-up
    0
    ·
    4 months ago

    I welcome this change actually. Now users can clearly see what others have been saying forever: If you don’t pay for the product, you ARE the product.

  • Please_Do_Not@lemm.ee
    link
    fedilink
    English
    arrow-up
    0
    ·
    4 months ago

    I work in marketing, and every client I work with who has a WordPress website is using AI to write a lot of their content. This is going to lead to circularly trained AI for sure.

      • Please_Do_Not@lemm.ee
        link
        fedilink
        English
        arrow-up
        0
        ·
        4 months ago

        Not sure, especially since they compare it to the Squareapace deal which I believe is for all sites built on the platform.

          • Please_Do_Not@lemm.ee
            link
            fedilink
            English
            arrow-up
            0
            ·
            4 months ago

            My misunderstanding. But it looks like you need a .org to self-host WP, and like 99% of WP-built sites are .com as far as I’ve seen. I definitely do not know the technicals about different ways to host/build on the same platform, so I certainly defer to you there, but in any case, my bet is that any site/platform that gets scraped indiscriminately will lead to a lot of circular AI training.

            • harsh3466@lemmy.ml
              link
              fedilink
              English
              arrow-up
              0
              ·
              4 months ago

              There are A LOT of self hosted Wordpress sites out there. Many of them you wouldn’t know unless told they were Wordpress (I believe both The Verge and TechCrunch use self hosted Wordpress). I myself have two self hosted Wordpress sites. Though I’ve been considering moving away from Wordpress for awhile now.

              • Roldyclark@literature.cafe
                link
                fedilink
                English
                arrow-up
                1
                ·
                4 months ago

                Yeah there are def more self hosted than not. Wordpress.org is just the site for the open source project. Most hosting sites come with 1 click WordPress installs. I’ve built so many sites with it.