this post was submitted on 06 Jul 2023
20 points (100.0% liked)

Singularity | Artificial Intelligence (ai), Technology & Futurology

660 readers
9 users here now

About:

This sublemmy is a place for sharing news and discussions about artificial intelligence, core developments of humanity's technology and societal changes that come with them. Basically futurology sublemmy centered around ai but not limited to ai only.

Rules:
  1. Posts that don't follow the rules and don't comply with them after being pointed out that they break the rules will be deleted no matter how much engagement they got and then reposted by me in a way that follows the rules. I'm going to wait for max 2 days for the poster to comply with the rules before I decide to do this.
  2. No Low-quality/Wildly Speculative Posts.
  3. Keep posts on topic.
  4. Don't make posts with link/s to paywalled articles as their main focus.
  5. No posts linking to reddit posts.
  6. Memes are fine as long they are quality or/and can lead to serious on topic discussions. If we end up having too much memes we will do meme specific singularity sublemmy.
  7. Titles must include information on how old the source is in this format dd.mm.yyyy (ex. 24.06.2023).
  8. Please be respectful to each other.
  9. No summaries made by LLMs. I would like to keep quality of comments as high as possible.
  10. (Rule implemented 30.06.2023) Don't make posts with link/s to tweets as their main focus. Melon decided that the content on the platform is going to be locked behind login requirement and I'm not going to force everyone to make a twitter account just so they can see some news.
  11. No ai generated images/videos unless their role is to represent new advancements in generative technology which are not older that 1 month.
  12. If the title of the post isn't an original title of the article or paper then the first thing in the body of the post should be an original title written in this format "Original title: {title here}".
  13. Please be respectful to each other.

Related sublemmies:

[email protected] (Our community focuses on programming-oriented, hype-free discussion of Artificial Intelligence (AI) topics. We aim to curate content that truly contributes to the understanding and practical application of AI, making it, as the name suggests, “actually useful” for developers and enthusiasts alike.)

Note:

My posts on this sub are currently VERY reliant on getting info from r/singularity and other subreddits on reddit. I'm planning to at some point make a list of sites that write/aggregate news that this subreddit is about so we could get news faster and not rely on reddit as much. If you know any good sites please dm me.

founded 1 year ago
MODERATORS
 

Article: https://gizmodo.com/google-says-itll-scrape-everything-you-post-online-for-1850601486

Article summarizing the article above: https://gizmodo.com/google-says-itll-scrape-everything-you-post-online-for-1850601486

Copy of the summarization:

Google has updated its privacy policy to explicitly state it can use virtually anything you post online to enhance its AI tools, a change that raises intriguing privacy questions and has prompted reactions from platforms such as Twitter and Reddit.

Google's New Privacy Policy: Google has altered its privacy policy to state that it can scrape almost any content posted online for the advancement of its AI tools.

· It uses this data to improve existing services and develop new products, features, and technologies.

· The data harvested aids in training Google's AI models and building products like Google Translate, Bard, and Cloud AI.

Impact on Internet Users: This policy modification challenges conventional concepts of online privacy.

· It suggests that any public post on the internet could be used by Google

· This practice necessitates a shift in how we perceive online activity, focusing on how the information could be employed rather than who can see it.

Legal and Copyright Concerns: The usage of data from the internet to fuel AI systems raises legal and copyright issues.

· It remains uncertain whether such a practice is legal, with courts likely to address these new copyright issues in the coming years.

· This practice affects consumers in surprising ways, raising questions about data ownership.

Reactions from Other Platforms: Twitter and Reddit have responded to this AI-related issue by restricting access to their APIs.

· This action aimed to protect their intellectual property from data scraping but resulted in breaking third-party tools used to access these platforms.

· Controversies have ensued, such as Twitter contemplating charging public entities for tweets, and Reddit seeing a mass protest due to API changes disrupting the work of moderators.

Elon Musk's Stance on Web Scraping: Elon Musk has recently expressed concerns about web scraping.

· He blamed several Twitter mishaps on the company's need to prevent others from data extraction.

· Despite these claims, most IT experts believe these problems are likely due to management issues or technical difficulties.

top 4 comments
sorted by: hot top controversial new old
[–] [email protected] 5 points 1 year ago (2 children)

I don't know why we should be surprised. This is just natural progression and even if they don't explicitly send it out to a specific location on its own eventually it will find and consume the information in one way or another unless it is kept in an isolated box. There is absolutely no legislation that will fix/stop this unless 100% of all people capable of creating these will respect such a decision and being the odd groups out will severely hamper their future.

[–] [email protected] 2 points 1 year ago (1 children)

Suppose there is legislation in the future; should it opt-in only or opt-out?

I think you'd have two very different data sets to train on there. Im not sure opt-out is better than no choice. These AI models are going to be out there doing things for us, i think we’re all better off if they have the highest quality of data to train on.

[–] [email protected] 0 points 1 year ago

Opt out, we already have robots.txt for this and would work just as well as any legislation.

You will never be able to tell if bad actors are violating your ToS or the law unless they keep raw data stored for inspection.

Even if AIPrime respects your wishes and doesn't scrape your site I potentially have the ability to feed it anything I have access to. I won't claim to have the largest data library on earth but I have been data hoarding since the mid 90s and would gladly share it with a company I feel is aligned with my beliefs.

[–] [email protected] 1 points 1 year ago* (last edited 1 year ago)

Rather than it being surprising it's just a post about their official statement on their position with this stuff where they say that they WILL do those things which is interesting.

load more comments
view more: next ›