• catloaf@lemm.ee
    link
    fedilink
    English
    arrow-up
    23
    arrow-down
    2
    ·
    2 days ago

    An HTTP request is a request. Servers are free to rate limit or deny access

    • taladar@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      14
      ·
      2 days ago

      Rate limiting in itself requires resources that are not always available. For one thing you can only rate limit individuals you can identify so you need to keep data about past requests in memory and attach counters to them and even then that won’t help if the requests come from IPs that are easily changed.

    • FaceDeer@fedia.io
      link
      fedilink
      arrow-up
      19
      ·
      2 days ago

      And Wikimedia, in particular, is all about publishing data under open licenses. They want the data to be downloaded and used by others. That’s what it’s for.

      • LostXOR@fedia.io
        link
        fedilink
        arrow-up
        6
        ·
        1 day ago

        Even so I think it would be totally reasonable for them to block web scrapers, as they provide better ways to download all their data.

        • FaceDeer@fedia.io
          link
          fedilink
          arrow-up
          9
          ·
          1 day ago

          At the root of this comment chain is a proposal to have laws passed about this.

          People can set up their web servers however they like. It’s on them to do that, it’s their web servers. I don’t think there should be legislation about whether you’re allowed to issue perfectly ordinary HTTP requests to a public server, let the server decide how to respond to them.