https://societyofauthors.org/2025/04/01/soa-day-of-action-following-allegations-of-metas-mass-theft-of-authors-work/

The SoA is organising a day of protest against Meta following revelations of pirated books being used to train their large language models

On Thursday 20 March, The Atlantic broke the story of how Meta has used the Library Genesis (LIbGen) dataset, which is full of pirated material, to develop their AI systems.

The revelations detailed by The Atlantic come against the background of the recent government consultation into Artificial Intelligence (AI) and copyright and the #MakeItFair campaign which sees the UK creative industries fighting back against the proposed changes to copyright law, which would favour multinational tech companies, but irremediably damage the creative industries.

  • I Cast Fist@programming.dev
    link
    fedilink
    English
    arrow-up
    31
    ·
    1 day ago

    The right thing for meta to do is to implode and cause a complete crash on the stock exchange as it is erased from history.

  • Pirata@lemm.ee
    link
    fedilink
    English
    arrow-up
    16
    arrow-down
    2
    ·
    1 day ago

    And some people still complain that US tech companies keep getting petty fines in Europe.

    We’re the only ones trying to enforce even an anemic degree of antitrust, since the US just allows them to do whatever.

      • General_Effort@lemmy.world
        link
        fedilink
        English
        arrow-up
        1
        ·
        19 hours ago

        I don’t see how this fair use case is different from those in the past. There’s a tech company defending. Organizations like the EFF or the Internet Archive issue supporting statements.

        I don’t see the hypocrisy. The content industry is suing tech companies now just like they have in the past, and just like they sue individuals now and in the past.

        If I had to guess at the cause of the difference, I’d say that there is a lot of money being spent on social media PR. But perhaps it also is a result of the right-ward shift of society. I wonder how much that has to do with propaganda by the content industry.

        • Lka1988@lemmy.dbzer0.com
          link
          fedilink
          English
          arrow-up
          1
          ·
          edit-2
          17 hours ago

          The hypocrisy is that if an average person did something like this (such as, IDK, Aaron Schwartz) they would have already been swiftly prosecuted and made out to be an example of “look what will happen to you”.

          It’s just the two-tiered justice system being what it is, again.

          • General_Effort@lemmy.world
            link
            fedilink
            English
            arrow-up
            2
            arrow-down
            1
            ·
            8 hours ago

            Yeah, that’s another one of the deliberately deceptive talking points being spread.

            First of all, average people did this. The dataset Books3 was created by a jobless individual named Shawn Presser using one of Aaron’s scripts. Later he shared it with Meta. What makes the difference for Shawn is that the legal department of Meta stands between him and the copyright industry. As far as I can tell, Shawn is way more average than Aaron in that he doesn’t rub shoulders with the likes of Sam Altman.

            It’s interesting how this talking point works. Someone shills for the copyright industry against the interests of the average person. And the justification is that the copyright industry persecuted Aaron Swartz. That doesn’t make sense, does it?

  • silverhand@reddthat.com
    link
    fedilink
    English
    arrow-up
    15
    arrow-down
    6
    ·
    1 day ago

    A small British trade union vs a 1.5 Trillion dollar American corporation?

    LOL, good luck

  • chamaeleon@fedia.io
    link
    fedilink
    arrow-up
    2
    ·
    1 day ago

    I was getting confused, not understanding why Structure of Arrays needed a day off action. Except perhaps to point out the benefits of locality of the same type of data for parallel processing, etc.