Maybe this is common knowledge, but I had no idea before. What an absolutely horrible decision from google to allow this. What are they thinking?? This is great for phishing and malware, but I don't know what else. (Yeah ok, the reason has probably something to do with "line must go up".)
TechTakes
Big brain tech dude got yet another clueless take over at HackerNews etc? Here's the place to vent. Orange site, VC foolishness, all welcome.
This is not debate club. Unless it’s amusing debate.
For actually-good tech, you want our NotAwfulTech community
I recall seeing something of this sort happening on goog for about 12~18mo - every so often a researcher post does the rounds where someone finds Yet Another way goog is fucking it up
the advertising dept has completely captured all mindshare and it is (demonstrably) the only part that goog-the-business cares about
CIDR 2025 is ongoing (Conference on Innovative Data Systems Research). It's a very good conference in computer science, specifically database research (an equivalent of a journal for non-CS science). And they have a whole session on LLMs called "LLMs ARE THE NEW NO-SQL"
I didn't have time to read the papers yet, believe me I will, but the abstracts are spicy
We systematically develop benchmarks to study [the problem] and find that standard methods answer no more than 20% of queries correctly, confirming the need for further research in this area.
(Text2SQL is Not Enough: Unifying AI and Databases with TAG, Biswal et al.)
Hey guys and gals, I have a slightly different conclusion, maybe a baseline 20% correctness is a great reason to not invest a second more of research time into this nonsense? Jesus DB Christ.
I'd also like to shoutout CIDR for setting up a separate "DATABASES AND ML" session, which is an actual research direction with interesting results (e.g. query optimizers powered by an ML model achieving better results than conventional query optimizers). At least actual professionals are not conflating ML with LLMs.
Polish commentary on Hitlergruß: https://bsky.app/profile/smutnehistorie.bsky.social/post/3lgaoyezhgc2c
Translation:
- it’s just a Hindu symbol of prosperity
- a normal Roman salute
- regular rail car
- wait a second
From the "flipping through LessWrong for entertainment" department:
What effect does LLM use have on the quality of people's thinking / knowledge?
- I'd expect a large positive effect from just making people more informed / enabling them to interpret things correctly / pointing out fallacies etc.
You'd think the AI safety chuds would have more reservations about using GPT, which they believe has sapience, to learn things. They have the concept of an AI being a good convincer, which, hey, idiots, how have none of you thought the great convincing has started? Also, how have none of you realised that maybe you should be a little harder to convince in general???
Reposting this for the new week thread since it truly is a record of how untrustworthy sammy and co are. Remember how OAI claimed that O3 had displayed superhuman levels on the mega hard Frontier Math exam written by Fields Medalist? Funny/totally not fishy story haha. Turns out OAI had exclusive access to that test for months and funded its creation and refused to let the creators of test publicly acknowledge this until after OAI did their big stupid magic trick.
From Subbarao Kambhampati via linkedIn:
"𝐎𝐧 𝐭𝐡𝐞 𝐬𝐞𝐞𝐝𝐲 𝐨𝐩𝐭𝐢𝐜𝐬 𝐨𝐟 “𝑩𝒖𝒊𝒍𝒅𝒊𝒏𝒈 𝒂𝒏 𝑨𝑮𝑰 𝑴𝒐𝒂𝒕 𝒃𝒚 𝑪𝒐𝒓𝒓𝒂𝒍𝒍𝒊𝒏𝒈 𝑩𝒆𝒏𝒄𝒉𝒎𝒂𝒓𝒌 𝑪𝒓𝒆𝒂𝒕𝒐𝒓𝒔” hashtag#SundayHarangue. One of the big reasons for the increased volume of “𝐀𝐆𝐈 𝐓𝐨𝐦𝐨𝐫𝐫𝐨𝐰” hype has been o3’s performance on the “frontier math” benchmark–something that other models basically had no handle on.
We are now being told (https://lnkd.in/gUaGKuAE) that this benchmark data may have been exclusively available (https://lnkd.in/g5E3tcse) to OpenAI since before o1–and that the benchmark creators were not allowed to disclose this *until after o3 *.
That o3 does well on frontier math held-out set is impressive, no doubt, but the mental picture of “𝒐1/𝒐3 𝒘𝒆𝒓𝒆 𝒋𝒖𝒔𝒕 𝒃𝒆𝒊𝒏𝒈 𝒕𝒓𝒂𝒊𝒏𝒆𝒅 𝒐𝒏 𝒔𝒊𝒎𝒑𝒍𝒆 𝒎𝒂𝒕𝒉, 𝒂𝒏𝒅 𝒕𝒉𝒆𝒚 𝒃𝒐𝒐𝒕𝒔𝒕𝒓𝒂𝒑𝒑𝒆𝒅 𝒕𝒉𝒆𝒎𝒔𝒆𝒍𝒗𝒆𝒔 𝒕𝒐 𝒇𝒓𝒐𝒏𝒕𝒊𝒆𝒓 𝒎𝒂𝒕𝒉”–that the AGI tomorrow crowd seem to have–that 𝘖𝘱𝘦𝘯𝘈𝘐 𝘸𝘩𝘪𝘭𝘦 𝘯𝘰𝘵 𝘦𝘹𝘱𝘭𝘪𝘤𝘪𝘵𝘭𝘺 𝘤𝘭𝘢𝘪𝘮𝘪𝘯𝘨, 𝘤𝘦𝘳𝘵𝘢𝘪𝘯𝘭𝘺 𝘥𝘪𝘥𝘯’𝘵 𝘥𝘪𝘳𝘦𝘤𝘵𝘭𝘺 𝘤𝘰𝘯𝘵𝘳𝘢𝘥𝘪𝘤𝘵–is shattered by this. (I have, in fact, been grumbling to my students since o3 announcement that I don’t completely believe that OpenAI didn’t have access to the Olympiad/Frontier Math data before hand… )
I do think o1/o3 are impressive technical achievements (see https://lnkd.in/gvVqmTG9 )
𝑫𝒐𝒊𝒏𝒈 𝒘𝒆𝒍𝒍 𝒐𝒏 𝒉𝒂𝒓𝒅 𝒃𝒆𝒏𝒄𝒉𝒎𝒂𝒓𝒌𝒔 𝒕𝒉𝒂𝒕 𝒚𝒐𝒖 𝒉𝒂𝒅 𝒑𝒓𝒊𝒐𝒓 𝒂𝒄𝒄𝒆𝒔𝒔 𝒕𝒐 𝒊𝒔 𝒔𝒕𝒊𝒍𝒍 𝒊𝒎𝒑𝒓𝒆𝒔𝒔𝒊𝒗𝒆–𝒃𝒖𝒕 𝒅𝒐𝒆𝒔𝒏’𝒕 𝒒𝒖𝒊𝒕𝒆 𝒔𝒄𝒓𝒆𝒂𝒎 “𝑨𝑮𝑰 𝑻𝒐𝒎𝒐𝒓𝒓𝒐𝒘.”
We all know that data contamination is an issue with LLMs and LRMs. We also know that reasoning claims need more careful vetting than “𝘸𝘦 𝘥𝘪𝘥𝘯’𝘵 𝘴𝘦𝘦 𝘵𝘩𝘢𝘵 𝘴𝘱𝘦𝘤𝘪𝘧𝘪𝘤 𝘱𝘳𝘰𝘣𝘭𝘦𝘮 𝘪𝘯𝘴𝘵𝘢𝘯𝘤𝘦 𝘥𝘶𝘳𝘪𝘯𝘨 𝘵𝘳𝘢𝘪𝘯𝘪𝘯𝘨” (see “In vs. Out of Distribution analyses are not that useful for understanding LLM reasoning capabilities” https://lnkd.in/gZ2wBM_F ).
At the very least, this episode further argues for increased vigilance/skepticism on the part of AI research community in how they parse the benchmark claims put out commercial entities."
Big stupid snake oil strikes again.
Every time they go 'this wasnt in the data' it turns out it was. A while back they did the same with translating rareish languages. Turns out it was trained on it. Fucked up. But also, wtf how are they expecting this to stay secret and there being no backlash? This world needs a better class of criminals.
The conspiracy theorist who lives in my brain wants to say its intentional to make us more open to blatant cheating as something that's just a "cost of doing business." (I swear I saw this phrase a half dozen times in the orange site thread about this)
The earnest part of me tells me no, these guys are just clowns, but I dunno, they can't all be this dumb right?
holy shit, that’s the excuse they’re going for? they cheated on a benchmark so hard the results are totally meaningless, sold their most expensive new models yet on the back of that cheated benchmark, further eroded the scientific process both with their cheating and by selling those models as better for scientific research… and these weird fucks want that to be fine and normal? fuck them
they can't even sell o3 really - in o3 high mode, needed to do this level of query, it's about $1000 per query lol
til that there's not one millionaire with family business in south african mining in current american oligarchy, but at least two. (thiel's father was an exec at mine in what is today Namibia). (they mined uranium). (it went towards RSA nuclear program). (that's easily most ghoulish thing i've learned today, but i'm up only for 2h)
Good news, everyone: critihype is canceled until the next tweet.
Hmm, surely there is no downside to doing all of one's marketing, both personal* and professional, through the false certainty and low signal of short-form social media. The leopard has only licked Sam's face, it will never bite and begin chewing!
*You and I may find the concept of a "personal brand" to be horrifying, but these guys clearly want to become brands more fervently than Bruce Wayne wanted to become a bat
This is a thought I've been entertaining for some time, but this week's discussion about Ars Technica's article on Anthropic, as well as the NIH funding freeze, finally prodded me to put it out there.
A core strategic vulnerability that Musk, his hangers-on, and geek culture more broadly haven't cottoned onto yet: Space is 20th-century propaganda. Certainly, there is still worthwhile and inspirational science to be done with space probes and landers; and the terrestrial satellite network won't dwindle in importance. I went to high school with a guy who went on to do his PhD and get into research through working with the first round of micro-satellites. Resources will still be committed to space. But as a core narrative of technical progress to bind a nation together? It's gassed. The idea that "it might be ME up there one day!" persisted through the space shuttle era, but it seems more and more remote. Going back to the moon would be a remake of an old television show, that went off the air because people ended up getting bored with it the first time. Boots on Mars (at least healthy boots with a solid chance to return home) are decades away, even if we start throwing Apollo money at it immediately. The more outlandish ideas like orbital data centers and asteroid mining don't have the same inspirational power, because they are meant to be private enterprises operated by thoroughly unlikeable men who have shackled themselves to a broadly destructive political program.
For better or worse, biotechnology and nanotechnology are the most important technical programs of the 21st century, and by backgrounding this and allowing Trump to threaten funding, the tech oligarchs kowtowing to him right now are undermining themselves. Biotech should be obvious, although regulatory capture and the impulse for rent-seeking will continue to hold it back in the US. I expect even more money to be thrown at nanotechnology manufacturing going into the 2030s, to try to overcome the fact that semiconductor scaling is hitting a wall, although most of what I've seen so far is still pursuing the Drexlerian vision of MEMS emulating larger mechanical systems... which, if it's not explicitly biocompatible, is likely going down a cul-de-sac.
Everybody's looking for a positive vision of the future to sell, to compete with and overcome the fraudulent tech-fascists who lead the industry right now. A program of accessible technology at the juncture of those two fields would not develop overnight, but could be a pathway there. Am I off base here?
This seems like yet another disconnect between however the fuck science communication has been failing the general public and myself.
Like when you say space I think, fuck yeah, space! Those crisp pictures of Pluto! Pictures of black holes! The amazing JWST data! Gravitational waves detection! Recreating the conditions of the early universe in particle accelerators to unlock the secrets of spacetime! Just most amazing geek shit that makes me as excited as I was when I was 12 looking at the night sky through my cheap-ass telescope.
Who gives a single fuck about sending people up there when we have probes and rovers, true marvels of engineering, feeding us data back here? Did you know Voyager 1, Voyager Fucking ONE, almost 50 years old probe, over 150 AU away from Earth, is STILL SENDING US DATA? We engineered the fuck of that bolt bucket so that even the people that designed it are surprised by how long it lasted. You think a human would last 50 years in the interstellar medium? I don't fucking think so.
We're unlocking the secrets of the universe and confirming theories from decades ago, has there been a more exciting time to be a scientist? Wouldn't you want to run a particle accelerator? Do science on the ISS? Be the engineer behind the next legendary probe that will benefit mankind even after you're gone? If you can't spin this into a narrative of technical progrees and humans being amazing then that's a skill issue, you lack fucking whimsy.
And I don't think there's a person in the world less whimsical than Elon fucking Musk.
Agree with space travel being retro-futurist fluff. It's very rich men badly remembering mediocre science fiction.
The US could lead the world in innovation in green technology but that's now tainted by wokeness.
https://xcancel.com/kailentit/status/1881476039454699630
"We did not have superintelligent relations with that…"
Rationalist death count keeps climbing https://xcancel.com/jessi_cata/status/1882182975804363141#m
What the fuck?
The agents were conducting a routine roving patrol when they stopped Bauckholt and a female in the town close to the border. During a records check, the unidentified female occupant was removed from the vehicle for further questioning, broke free, and began shooting at the agents, the incident report shows.
After the female suspect was hit by return fire, Bauckholt emerged from the vehicle and also began firing on the agents. He sustained gunshot wounds and was pronounced dead.
....... What the fuck?
Jesus wept, it's so frustratingly obvious that anytime some flavor of cop kills someone, the news media reporting (if any) will be this weird Yoda grammar pidgin.
The femoidically gendered female shot with its gun by very personally pulling the trigger, with this viscerally physical action performed by the said femalian in most pointedly concrete terms amounting to it (the femaloidistical entity, a specimen of the species known as females) firing lethal gunshots at the border patrol with the female's own two hands.
Subsequently return fire manifested itself from somewhere and came into contact with the female suspect female. The Justice Enforcement Officers involved in the situation were made a part of a bilateral exchange of gunfire between the shooting female and the officers situated in the scenario in which shooting was, to some extent, quite possibly performed from their side as well.
hackernews: We're going to build utopia on Mars, reinvent money, and construct god.
also hackernews: moving off facebook is too hard :( :( :(
You may have heard that Catturd doesn't have any fiber in his diet and was hospitalized for bowel blockage. (Best sneer I've seen so far: "can't turd.") Along similar lines, Srid isn't taking his statins for high cholesterol caused by a carnivore diet.
Meta: I'm kind of pissed that Catturd is WP notable but laughing my ass off at the page for carnivore diets. Life takes and gives.
My favorite part of the carnivore diet is that apparently scurvy can become enough of a problem that you'll see references to "not wanting to start the vitamin C debate" in forums.
I'm pretty sure it's not just a me thing, but I thought we all knew that sailors kept citrus on board specifically to prevent scurvy by providing vitamin C and that we all learned about this as kids when either a teacher tried to make the colonial era interesting or we got vaguely curious about pirates at some point.
So that's how to translate "Yo, this diet is for chumps" into Wikipedian.
so I ran into this fucking garbage earlier, which goes so hard on the constituent parts of "the spam is the point", an ouroborosian self-reinforcing loop of Just More Media Bro Just One More Video Bro You'll See Bro It'll Be The Best Listicle Bro Just Watch Bro, and the insufferably cancerous "the medium is the message" videos-made-for-youtube-because-youtube that if it were a voltron it'd probably have its own unique Special Moment sequence instead of being one of the canned assembly shots
various topics (e.g., AI news, crypto, fitness, personal finance)
That sure is a specifc selection of topics.
Banner start to the next US presidency, with Wiener Von Wrong tossing a Nazi salute and the ADL papering that one over as an "awkward gesture". 2025 is going to be great for my country.
Incidentally is "Wiener Von Wrong" or "Wernher Von Brownnose" better?
I hope everyone is ready for the constant overlap between politics and AI / Silicon Valley; because I'm not.
Trump Admin Accused of Using AI to Draft Executive Orders (Source Bluesky Thread).
I'm not 100% sure I buy that the EOs were written by AI rather than people who simply don't care about or don't know the details; but it certainly looks possible. Especially that example about the Gulf of Mexico. Either way I am heartened that this is the conclusion people jump to.
Aside: I also like how much media is starting to cite bluesky (and activitypub to a lesser extent). I assume a bunch of journalists moved off of twitter or went multi-platform.
following on from this comment, it is possible to get it turned off for a Workspace Suite Account
- contact support (
?
button from admin view) - ask the first person to connect you to
Workspace Support
(otherwise you'll get some made-up bullshit from a person trying to buy time or Case Success or whatever, simply because they don't have the privileges to do what you're asking) - tell the referred-to person that you want to enable controls for "Gemini for Google Workspace" (optionally adding that you have already disabled "Gemini App")
hopefully you spend less time on this than the 40-something minutes I had to (a lot of which was spent watching some poor support bastard start-stop typing for minutes at a time because they didn't know how to respond to my request)
You gotta love how in the announcement the guy is so blatantly "hey they said and did such nice things for me that I just got a throw them a bone, and if releasing the leader of a notorious drug bazaar who tried to put out a hit on one of his employees is what they want then they can have it!"
Sidenote: AFAIK, even with this pardon, Ulbricht still ended up spending more time in prison than if he took a plea deal he was reportedly offered:
He was offered a plea deal, which would have likely given him a decade-long sentence, with the ability to get out early on good behavior. Worst-case scenario, he would have spent five years in a medium-security prison and been freed.
Gotta say, this whole situation's reminding me of SBF - both of them thought they could outsmart the Feds, and both received much harsher sentences than rich white collar criminals usually get as a result.