IncognitoErgoSum

joined 1 year ago
[–] [email protected] 1 points 1 year ago (1 children)

If I'm the "parent comment" you're referring to, then that's very much not my motivation.

You're not. I was talking about the thread parent: "Many things in life are a privilege for these groups. AI is no different." I should have been more specific.

At any rate, I personally feel that we have a moral responsibility to make it accessible to as many people as possible.

[–] [email protected] 2 points 1 year ago (5 children)

As the technology improves, data centers that run AI will require significantly less cooling. GPUs aren't very power-efficient for doing AI stuff because they have to move a lot of data around from their memory to their processor cores. There are AI-specific cards being worked on that will allow the huge matrix multiplications to happen in place without that movement happening, which will mean drastically lower power and cooling requirements.

Also, these kinds of protestors are the same general group of people who stopped nuclear power from becoming a bigger player back in the 1960s and 70s. If we'd gone nuclear and replaced coal, we almost certainly wouldn't be sitting here at the beginning of what looks to be a major global warming event that's unlike anything we've ever seen before. It wouldn't have completely solved the problem, but it would have bought us time. An AI may be able to help us develop ideas to mitigate global warming, and it seems ridiculous to me to go all luddite and smash the machines over what will be a minuscule overall contribution to it given the possibility that it could help us solve the problem.

But let's be real here; these hypothetical people smashing the machines are doing it because they've bought into AI panic, not because they're afraid of global warming. If they really want to commit acts of ecoterrorism, there are much bigger targets.

[–] [email protected] 2 points 1 year ago (3 children)

So clearly we do agree on most of this stuff, but I did want to point out a possibility you may not have considered.

If we're just talking about what you can do, then these laws aren't going to matter because you can just pirate whatever training material you want.

This depends on the penalty and how strictly it's enforced. If it's enforced like normal copyright law, then you're right; your chances of getting in serious trouble just for downloading stuff are essentially nil -- the worst thing that will happen to you is your ISP will three-strikes you and you'll lose internet access. On the other hand, there's a lot of panic surrounding AI, and the government might use that as an excuse to pass laws that would give people prison time for possessing one, and then fund strict enforcement. I hope that doesn't happen, but with rumblings of insane laws that would give people prison time for using a VPN to watch a TV show outside of the country, I'm a bit concerned.

As for the parent comment's motivations, it's hard to say for sure with any particular individual, but I have noticed a pattern among neoliberals where they say things like "well, the rich are already powerful and we can't do anything about it, so why try" or "having universal health care, which the rest of the first world has implemented successfully, is unrealistic, so why try" and so on. It often boils down to giving lip service to progressive social values while steadfastly refusing to do anything that might actually make a difference. It's economic conservatism dressed as progressivism. Even if that's not what they meant (and it would be unwise of me to just assume that), I feel like that general attitude needs to be confronted.

[–] [email protected] 8 points 1 year ago (1 children)

AI is more than just ChatGPT.

When we talk about reinterpreting copyright law in a way that makes AI training essentially illegal for anything useful, it also restricts smaller and potentially more focused networks. They're discovering that smaller networks can perform very well (not at the level of GPT-4, but well enough to be useful) if they're trained in a specific way where reasoning steps are spelled out in the training.

Also, there are used nvidia cards currently selling on Amazon for under $300 with 24 gigs of ram and AI performance almost equal to a 3090, which puts group-of-experts models like a smaller version of GPT-4 within reach of people who aren't ultra-wealthy.

There's also the fact that there are plenty of companies currently working on hardware that will make AI significantly cheaper and more accessible to home users. Systems like ChatGPT aren't always going to be restricted to giant data centers, unless (as some people really want) laws are passed to prevent that hardware from being sold to regular people.

[–] [email protected] 4 points 1 year ago (38 children)

Losing their life because an AI has been improperly placed in a decision making position because it was sold as having more capabilities than it actually has.

I would tend to agree with you on this one, although we don't need bad copyright legislation to deal with it, since laws can deal with it more directly. I would personally put in place an organization that requires rigorous proof that AI in those roles is significantly safer than a human, like the FDA does for medication.

As for the average person who has the computer hardware and time to train an AI (bear in mind Google Bard and Open AI use human contractors to correct misinformation in the answers as well as scanning), there is a ton of public domain writing out there.

Corporations would love if regular people were only allowed to train their AIs on things that are 75 years out of date. Creative interpretations of copyright law aren't going to stop billion- and trillion-dollar companies from licensing things to train AI on, either by paying a tiny percentage of their war chests or just ignoring the law altogether the way Meta always does, and getting a customary slap on the wrist. What will end up happening is that Meta, Alphabet, Microsoft, Elon Musk and his companies, government organizations, etc. will all have access to AIs that know current, useful, and relevant things, and the rest of us will not, or we'll have to pay monthly for the privilege of access to a limited version of that knowledge, further enriching those groups.

Furthermore, if they're using people's creativity to make a product, it's just WRONG not to have permission or to not credit them.

Let's talk about Stable Diffusion for a moment. Stable Diffusion models can be compressed down to about 2 gigabytes and still produce art. Stable Diffusion was trained on 5 billion images and finetuned on a subset of 600 million images, which means that the average image contributes 2B/600M, or a little bit over three bytes, to the final dataset. With the exception of a few mostly public domain images that appeared in the dataset hundreds of times, Stable Diffusion learned broad concepts from large numbers of images, similarly to how a human artist would learn art concepts. If people need permission to learn a teeny bit of information from each image (3 bytes of information isn't copyrightable, btw), then artists should have to get permission for every single image they put on their mood boards or use for inspiration, because they're taking orders of magnitude more than three bytes of information from each image they use for inspiration on a given work.

[–] [email protected] 5 points 1 year ago (7 children)

Why do you think people will build data centers in Europe when they can build them elsewhere?

[–] [email protected] 6 points 1 year ago (9 children)

For something to be a fact, it needs to actually be true. AI is currently accessible to everyone.

[–] [email protected] 5 points 1 year ago

Wow, you have this all planned out, don't you?

If that's what Europe is like, they'll build their data centers somewhere else. Like the corrupt USA. Again, you'll be taking away your access to AI, not theirs.

[–] [email protected] 10 points 1 year ago* (last edited 1 year ago) (11 children)

I'm not sure what you're getting at with this. It will only be a privilege for these groups of we choose to artificially make it that way. And why would you want to do that?

Do you want to give AI exclusively to the rich? If so, why?

[–] [email protected] 3 points 1 year ago

Is kbin a place where we just call everyone we don't like "techbros"?

[–] [email protected] 2 points 1 year ago

Also, working in open source means having a proper understanding of licensing and ownership. Open source doesn't mean "free this and free that" -- in fact, many AI based code assistance tools are actually hurting the open source initiative by not properly respecting the license of the code base it's studying from.

Don't be patronizing. I've been involved in open source for 20+ years, and I know plenty about licensing.

What you're talking about is changing copyright law so that you'll have to license content in order for an AI to learn concepts from that content (in other words, to be able to summarize it, learn facts from it, learn an art style, and so on). This isn't how copyright law currently works, and I hope to god it stays that way.

For example, if you don't own the right of the original copy of Star Wars, you obviously wouldn't own any rights over the output of an upscaled Star Wars. Same goes for writing or other "transformative" media and it has been this way for a long time (see: audio sampling)

That's not the same thing as training and AI on Star Wars. If you feed Star Wars into an upscaling AI, the AI is processing each frame and creating an output that's a derivative work on that frame, and result of that isn't something you would be allowed to release without a license. If you train it on Star Wars, the AI would learn general concepts from Star Wars, and not be able to produce an upscaled version of the movie verbatim (although depending on the AI, it may be able to produce images in the general style of Star Wars or summarize the movie).

An appropriate analogy for what's going on here would be reading a book and then talking about the facts I learned from that book, which is in no way a violation of copyright law. If I started quoting long sections of that book verbatim, I would need a license from the author, but that's not how AI works. It's not learning the sentences those people type verbatim, it's picking up concepts and facts from them. Even if I were to memorize the book from cover to cover, I would be in the clear as long as I didn't actually start reproducing the book in some way. Neural networks are learning machines, not databases. Their purpose isn't to reproduce information verbatim.

If you're still not clear on the difference between training on data and processing it, let me know and I'll try to clarify further.

view more: ‹ prev next ›