0 points

As I understand it, this is only about using search results for summaries. If it’s just that and links to the source, I think it’s OK. What would be absolutely unacceptable is to use the web in general as training data for text and image generation (=write me a story about topic XY).

permalink
report
reply
14 points

If it’s just that and links to the source, I think it’s OK.

No one will click on the source, which means the only visitor to your site is Googlebot.

What would be absolutely unacceptable is to use the web in general as training data for text and image generation.

This has already happened and continues to happen.

permalink
report
parent
reply
7 points

No one will click on the source, which means the only visitor to your site is Googlebot.

That was the argument with the text snippets from news sources. Publishers successfully lobbied for laws to be passed in many countries that required search engine operators to pay fees. It backfired when Google removed the snippets from news sources that demanded fees from Google. Their visitors dropped by a massive amount, 90% or so, because those results were less attractive to Google users to click on than the nicer results with a snippet and a thumbnail. So “No one will click on the source” has already been disproven 10 or so years ago when the snippet issue was current. All those publishers have entered a free of charge licensing agreement with Google and the laws are still in place. So Google is fine, upstart search engines are not because those cannot pressure the publishers into free deals.

This has already happened and continues to happen.

With Gemini?

permalink
report
parent
reply
4 points
*
Deleted by creator
permalink
report
parent
reply
2 points

The context is not the same. A snippet is incomplete and often lacking important details. It’s minimally tailored to your query unlike a response generated by an LLM. The obvious extension to this is conversational search, where clarification and additional detail still doesn’t require you to click on any sources; you simply ask follow up questions.

With Gemini?

Yes. How do you think the Gemini model understands language in the first place?

permalink
report
parent
reply
3 points

that latter will be the case rather sooner than later I’m afraid. It’s just a matter of time with Google.

permalink
report
parent
reply
1 point

that latter will be the case rather sooner than later I’m afraid. It’s just a matter of time with Google.

If that will actually be the case and passes legal challenges, basically all copyright can be abolished which would definitively have some upsides but also downsides. All those video game ROM decompilation projects would be suddenly in the clear, as those are new source code computer-generated from copyrighted binary code, so not really different from a AI generated image based on a copyrighted image used as training data. We could also ask Gemini write a full-length retelling of Harry Potter and just search, replace all trademarked names, and sell that shit. Evil companies could train an AI on GNU/Linux source codes and tell it to write an operating system. Clearly derived work from GPL code but without any copyright to speak of, all that generated code could be legally closed. I don’t like that.

permalink
report
parent
reply
1 point

I really hope those ROM sites will be cleared sooner than later. It hurt a lot to see some of the biggest ROM sites force to close. Please sign: https://citizens-initiative.europa.eu/initiatives/details/2024/000007_en

permalink
report
parent
reply
243 points
Deleted by creator
permalink
report
reply
23 points
  1. Say no
  2. You don’t show up in Google search results
  3. You still show up in other search results
  4. Google is no longer bringing the best results
  5. People stop using your site
  6. You lose
permalink
report
parent
reply
41 points

Part 5 is where I don’t see this actually going.

Look at twitter. Now look at mastodon. Tell me which one is more shitty. Now tell me which one has something like 85% of the market, and which one most people haven’t heard of.

Just because something it better, doesn’t mean people use it. You can fit all of Lemmy in the world in one of the larger NBA size arenas. You can’t even fit twitters total user base into some smaller CITIES.

permalink
report
parent
reply
1 point

Twitter will be dead within 1-2 years, Elon will make sure of that

permalink
report
parent
reply
14 points

He’s already owned it for nearly two years. I’d definitely take the over on that bet. I just don’t see what Twitter could possibly do that they haven’t done already to kill it?

permalink
report
parent
reply
8 points

God I hope you’re right, but doubt you are.

permalink
report
parent
reply
2 points

I think the amount of people who are familiar with search engine options besides Google is quite a bit larger than the population of Lemmy. (It fuckin better be, anyway)

permalink
report
parent
reply
105 points

Google results are actually already pretty terrible. They just have tremendous inertia.

permalink
report
parent
reply
7 points

We all keep saying this but can anybody point me to which one is better?

I invariably end up having to go back to them because the other search engines all have their own problems.

The issue is the internet is polluted with SEO and all the useful things that used to be spread out are now condensed onto places like Reddit, or places that aren’t even being indexed.

permalink
report
parent
reply
9 points

Supposedly there’s a paid one that is good. I haven’t tried. The thing is Google is completely enshittified. They don’t have to care about you or the sites you search. So my theory is Bing is better because they are hungrier and anything that takes away market share from Google is good—but I’m fully aware that Microsoft was just as shitty as Google and will be again if they get back on top.

Everything else I know of is either just an alternate front end for one of them or an aggregator of both. So you’re right, there’s precious little alternative to Google. But it’s almost bad enough I’m ready for the return of web rings of good sites vouching for each other.

permalink
report
parent
reply
24 points

I stopped using them months ago. I only notice when I’m looking for places (e.g., restaurants, barbers).

I’m not unhappy but may still shop around.

permalink
report
parent
reply
6 points

yeah, I appreciate the push towards more privacy-centric search engines but as a result searches that are relevant to me geographically on places like startpage are next to useless. I understand why but I wish that local results were a bit better on the alternatives.

permalink
report
parent
reply
24 points

I’m pretty pessimistic about this:

  1. Say no
  2. Google still scrapes your site to train their AI
  3. People don’t care that its wrong, still use Google instead of other search engines
permalink
report
parent
reply
14 points

Unfortunately, the vast majority of people do not give a single fuck and they will use whatever is preinstalled on their device

permalink
report
parent
reply
-2 points

I’m not sure of the advantages of showing up in Google search results. It seems like something that I wouldn’t want to happen anyway.

permalink
report
reply
3 points

Bing to finally overtake Google? Inconceivable!

permalink
report
reply
2 points
Deleted by creator
permalink
report
parent
reply
59 points

We’re at a point where not only should the Internet be classified as a utility, so should Search.

permalink
report
reply
26 points
*

Yeah, it’s not just e.g. water that is the utility, pipes and pumping stations are part of it. Otherwise you have water…uh…somewhere, go get it yourself.

permalink
report
parent
reply

Technology

!technology@lemmy.world

Create post

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


Community stats

  • 17K

    Monthly active users

  • 15K

    Posts

  • 650K

    Comments