A new tool lets artists add invisible changes to the pixels in their art before they upload it online so that if it’s scraped into an AI training set, it can cause the resulting model to break in chaotic and unpredictable ways.

The tool, called Nightshade, is intended as a way to fight back against AI companies that use artists’ work to train their models without the creator’s permission.
[…]
Zhao’s team also developed Glaze, a tool that allows artists to “mask” their own personal style to prevent it from being scraped by AI companies. It works in a similar way to Nightshade: by changing the pixels of images in subtle ways that are invisible to the human eye but manipulate machine-learning models to interpret the image as something different from what it actually shows.

    • 0xD@infosec.pub
      link
      fedilink
      English
      arrow-up
      38
      arrow-down
      8
      ·
      1 year ago

      I don’t see a problem with it training on all materials, fuck copyright. I see the problem in it infringing on everyone’s copyright and then being proprietary, monetized bullshit.

      If it trains on an open dataset, it must be completely and fully open. Everything else is peak capitalism.

      • Smoogs@lemmy.world
        link
        fedilink
        English
        arrow-up
        11
        arrow-down
        14
        ·
        1 year ago

        You’re not owed nor entitled to an artist’s time and work for free.

        • Turun@feddit.de
          link
          fedilink
          English
          arrow-up
          10
          arrow-down
          3
          ·
          edit-2
          1 year ago

          Of course not, it’s the artists decision to put it on the internet for free.

          Technically that’s the root of the issue. This does not grant a license to everyone who looks at it, but if a license is required to train a model is unclear and currently discussed in court.

        • barsoap@lemm.ee
          link
          fedilink
          English
          arrow-up
          4
          ·
          edit-2
          1 year ago

          I am perfectly entitled to type random stuff into google images, pick out images for a mood board and some as reference, regardless of their copyright status, thank you. Studying is not infringement.

          It’s what every artist does, it’s perfectly legal, and what those models do is actually even less infringing because they’re not directly looking at your picture of a giraffe and my picture of a zebra when drawing a zebra-striped giraffe, they’re doing it from memory.

            • barsoap@lemm.ee
              link
              fedilink
              English
              arrow-up
              1
              arrow-down
              1
              ·
              edit-2
              1 year ago

              And if you think that working with AI does not take effort you either did not try, or don’t have an artistic bone in your body. Randos typing “Woman with huge bazingas” into an UI and hitting generate don’t get copyright on the output, rightly so: Not just did they not do anything artistic, they also overlook all the issues with whatever gets generated because they lack the trained eye of an artist.

    • ElectroVagrant@lemmy.worldOP
      link
      fedilink
      English
      arrow-up
      13
      arrow-down
      3
      ·
      1 year ago

      Until the law catches up with the technology, people need ways of protecting themselves.

      I agree, and I wonder if the law might be kicked into catching up quicker as more companies try to adopt these tools and inadvertently infringe on other companies’ copyrighted material. 😅

    • 9thSun@midwest.social
      link
      fedilink
      English
      arrow-up
      13
      arrow-down
      6
      ·
      1 year ago

      How is training AI with art on the web different to a person studying art styles? I’d say if the AI is being monetized in some capacity, then sure maybe there should be laws in place. I’m just hard-pressed to believe that anyone can have sole control of anything once it gets on the Internet.

      • FooBarrington@lemmy.world
        link
        fedilink
        English
        arrow-up
        6
        arrow-down
        1
        ·
        1 year ago

        I agree that the training isn’t fundamentally different, but that monetization of the output has to be controlled. The big difference between AI and humans is the speed with which they create - you have to employ an army of humans to match the output of a couple of GPUs. For noncommercial projects this is amazing. For commercial projects, it destroys the artists livelihoods.

        But this simply means that training shouldn’t be controlled, inference in commercial contexts should be.

      • Zeth0s@lemmy.world
        link
        fedilink
        English
        arrow-up
        5
        arrow-down
        3
        ·
        edit-2
        1 year ago

        I work in AI and I believe it is different. Society is built to distribute wealth, so that everyone can live a decent life. People and AI should be treated differently in front of the law. Also, non-commercial, open source AI should be treated differently than commercial or closed source models

        • V H@lemmy.stad.social
          link
          fedilink
          English
          arrow-up
          10
          arrow-down
          2
          ·
          1 year ago

          Society is built to distribute wealth, so that everyone can live a decent life.

          As a goal, I admire it, but if you intend this as a description of how things are it’d be boundlessly naive.

      • rhombus@sh.itjust.works
        link
        fedilink
        English
        arrow-up
        1
        arrow-down
        1
        ·
        1 year ago

        The real issue comes in ownership of the AI models and the vast amount of labor involved in the training data. It’s taking what is probably hundreds of thousands of hours of labor in the form of art and converting it into a proprietary machine, all without compensating the artists involved. Whether you can make a comparison to a human studying art is irrelevant, because a corporation can’t own an artist, but they can own an AI and not have to pay it.

      • realharo@lemm.ee
        link
        fedilink
        English
        arrow-up
        3
        arrow-down
        8
        ·
        edit-2
        1 year ago

        How is training AI with art on the web different to a person studying art styles?

        Human brains clearly work differently than AI, how is this even a question?

        The term “learning” in machine learning is mainly a metaphor.

        Also, laws are written with a practical purpose in mind - they are not some universal, purely philosophical construct and never have been.

        • V H@lemmy.stad.social
          link
          fedilink
          English
          arrow-up
          8
          arrow-down
          2
          ·
          1 year ago

          Human brains clearly work differently than AI, how is this even a question?

          It’s not all that clear that those differences are qualitatively meaningful, but that is irrelevant to the question they asked, so this is entirely a strawman.

          Why does the way AI vs. the brain learn make training AI with art make it different to a person studying art styles? Both learn to generalise features that allows them to reproduce them. Both can do so without copying specific source material.

          The term “learning” in machine learning is mainly a metaphor.

          How do the way they learn differ from how humans learn? They generalise. They form “world models” of how information relates. They extrapolate.

          Also, laws are written with a practical purpose in mind - they are not some universal, purely philosophical construct and never have been.

          This is the only uncontroversial part of your answer. The main reason why courts will treat human and AI actions different is simply that they are not human. It will for the foreseeable future have little to do whether the processes are similar enough to how humans do it.

          • realharo@lemm.ee
            link
            fedilink
            English
            arrow-up
            2
            arrow-down
            8
            ·
            edit-2
            1 year ago

            Now you’re just cherry picking some surface-level similarities.

            You can see the difference in the process in the results, for example in how some generated pictures will contain something like a signature in the corner, simply because it resembles the training data - even though there is no meaning to it. Or how it is at least possible to get the model to output something extremely close to the training data - https://gizmodo.com/ai-art-generators-ai-copyright-stable-diffusion-1850060656.

            That at least proves that the process is quite different to the process of human learning.

            The question is how much those differences matter, and which similarities you want to focus on.

            Human learning is similar in some ways, but greatly differs in other ways.

            The fact that you’re picking and choosing which similarities matter and which don’t is just your arbitrary choice.

            • V H@lemmy.stad.social
              link
              fedilink
              English
              arrow-up
              9
              arrow-down
              2
              ·
              edit-2
              1 year ago

              You can see the difference in the process in the results, for example in how some generated pictures will contain something like a signature in the corner

              If you were to train human children on an endless series of pictures with signatures in the corner, do you seriously think they’d not emulate signatures in the corner?

              If you think that, you haven’t seen many children’s drawings, because children also often pick up that it’s normal to put something in the corner, despite the fact that to children pictures with signatures is a tiny proportion of visual input.

              Or how it is at least possible to get the model to output something extremely close to the training data

              People also mimic. We often explicitly learn to mimic - e.g. I have my sons art folder right here, full of examples of him being explicitly taught to make direct copies as a means to learn technique.

              We just don’t have very good memory. This is an argument for a difference in ability to retain and reproduce inputs, not an argument for a difference in methods.

              And again, this is a strawman. It doesn’t even begin to try to answer the questions I asked, or the one raised by the person you first responded to.

              That at least proves that the process is quite different to the process of human learning.

              Neither of those really suggests that all (that diffusion is different to humans learn to generalize images is likely true, what you’ve described does not provide even the start of any evidence of that), but again that is a strawman.

              There was no claim they work the same. The question raised was how the way they’re trained is different from how a human learns styles.

    • regbin_@lemmy.world
      link
      fedilink
      English
      arrow-up
      10
      arrow-down
      4
      ·
      edit-2
      1 year ago

      Disagree. It’s only unethical if you use it to generate the artist’s existing pieces and claim it as yours.

        • 9thSun@midwest.social
          link
          fedilink
          English
          arrow-up
          3
          ·
          1 year ago

          I don’t see how AI training couldn’t be considered transformative as the whole idea is to consume input, break it down into data, and output something new. The way I’m understanding what you’re saying is like this: Instead of only paying royalties when I try to monetize a cover song, I’d have to pay every time I practiced it.

            • 9thSun@midwest.social
              link
              fedilink
              English
              arrow-up
              1
              ·
              1 year ago

              I don’t understand how you’re separating the the generated artworks from the AI that’s generating the work, but I do see your point. If a company puts out a tool for free I don’t think they should be on the hook for someone using that and creating a product. At the end of it all though, I think whoever has made any hard financial gains should should payout whoever contributed.