• Glitchvid@lemmy.world
      link
      fedilink
      English
      arrow-up
      24
      ·
      17 hours ago

      The amount of stupid AI scraping behavior I see even on my small websites is ridiculous, they’ll endlessly pound identical pages as fast as possible over an entire week, apparently not even checking if the contents changed. Probably some vibe coded shit that barely functions.

    • cm0002@lemmy.world
      link
      fedilink
      English
      arrow-up
      103
      ·
      24 hours ago

      Right‽ This is ridiculously stupid when you can download the entirety of Wikipedia in a single package and parse it to your hearts desire

      • TheTechnician27@lemmy.world
        link
        fedilink
        English
        arrow-up
        72
        ·
        edit-2
        23 hours ago

        Not only that, but we make it goddamn trivial for not just Wikipedia but for other Wikimedia projects. Doing this is just stealing without attribution and share-alike like the CC BY-SA 4.0 license demands and then on top of that kicking down the ladder for people who actually want to use Wikimedia and not the hallucinatory slop they’re trying to supplant it with. LLM companies have caused incalculable damage to critical thinking, the open web, the copyleft movement, and the climate.