BigMuffin69

joined 1 year ago
[โ€“] BigMuffin69@awful.systems 9 points 1 day ago (1 children)

I know longer remember what this man actually looks like

In b4 there's a 100k word essay on LW about how intentionally crashing the economy will dry up VC investment in "frontier AGI labs" and thus will give the ๐Ÿ€s more time to solve "alignment" and save us all from big ๐Ÿ mommy. Therefore, MAGA harming every human alive is in fact the most effective altruism of all! Thank you Musky, I just couldn't understand your 10,000 IQ play.

[โ€“] BigMuffin69@awful.systems 9 points 2 days ago* (last edited 2 days ago)

Mr. President, this is simply too much winning, I cannot stand the winning anymore ๐Ÿ˜ญ

[โ€“] BigMuffin69@awful.systems 15 points 2 days ago* (last edited 2 days ago) (9 children)

Tech stonks continuing to crater ๐Ÿซง ๐Ÿซง ๐Ÿซง

I'm sorry for your 401Ks, but I'd pay any price to watch these fuckers lose.

spoiler(mods let me know if this aint it)

[โ€“] BigMuffin69@awful.systems 0 points 1 week ago* (last edited 1 week ago)

To be fair, you have to have a really high IQ to understand why my ouija board writing " A " " S " " S " is not an existential risk. Imo, this shit about AI escaping just doesn't have the same impact on me after watching Claude's reasoning model fail to escape from Mt Moon for 60 hours.

[โ€“] BigMuffin69@awful.systems 4 points 1 week ago* (last edited 1 week ago) (2 children)

was just in a chat room with an anthropic employee and she said, "if you have a solution for x, we are hiring" and before I could even say, "why would I want to work for a cult?" she literally started saying "some people underestimate the super exponential of progress"

To which I replied, "the only super exponential I'm seeing rn is Anthropic's negative revenue."

[โ€“] BigMuffin69@awful.systems 5 points 2 weeks ago* (last edited 2 weeks ago)

One more tidbit, I checked in and it's been stuck in Mt Moon first floor for 6 hours. Just out of curiosity, I asked an OAI model "what do I do if im stuck in mount moon 1F" and it spit a step-by-step guide how to navigate the cave with the location of each exit and what to look for, so yeah, even without someone hardcoding hints in the model, just knowing the game state and querying what's next suffices to get the next step to progress the game.

[โ€“] BigMuffin69@awful.systems 9 points 2 weeks ago (2 children)

"Even teenage delinquents and homeless beggars love it. The only group that gives me hateful looks is the radical socialists."

[โ€“] BigMuffin69@awful.systems 8 points 2 weeks ago* (last edited 2 weeks ago)

I had a similar disc with one of my friends! Anthropic is bragging that the model was not trained to play pokemon, but pokemon red has massive wikis for speed running that based on the reasoning traces are clearly in the training data. Like the model trace said it was "training a nidoran to level 12 b.c. at level 12 nidoran learns double kick which will help against brock's rock type pokemon", so it's not going totally blind in the game. There was also a couple outputs when it got stuck for several hours where it started printing things like "Based on the hint..." which seemed kind of sus. I wouldn't be surprised if it there is some additional hand holding going on in the back based on the game state (i.e., go to oaks, get a starter, go north to viridian, etc.) that help guide the model. In fact, I'd be surprised if this wasn't the case.

[โ€“] BigMuffin69@awful.systems 7 points 2 weeks ago* (last edited 2 weeks ago) (3 children)

So they had the new Claude hooked up to some tools so that it could play Pokemon red. Somewhat impressive (at least to me!) It was able to beat lt surge after several days of play. They had a stream demo'ing it on twitch and despite the on paper result of getting 3 gym badges, poor fellas got stuck in Viridian forest trying to find the exit to the maze.

As far as finding the exit goes... I guess you could say he was stumped? (MODS PLEASE DONT BAN)

strim if anyone is curious. Yes, i know this is clever advertising for anthropic, but i do find it cute and maybe someone else will?

https://www.twitch.tv/claudeplayspokemon

[โ€“] BigMuffin69@awful.systems 9 points 2 weeks ago (13 children)

Bruh, Big Yud was yapping that this means the orthogonality thesis is false and mankind is saved b.c. of this. But then he immediately retreated to, "we are all still doomed b.c. recursive self-improvement." I wonder what it's like to never have to update your priors.

Also, I saw other papers that showed almost all prompt rejection responses shared common activation weights and tweeking them can basically jailbreak any model, so what is probably happening here is that by finetuning to intentionally make malicious code, you are undoing those rejection weights + until this is reproduced by nonsafety cranks im pressing x to doubt.

[โ€“] BigMuffin69@awful.systems 6 points 1 month ago* (last edited 1 month ago)

Lmaou. "We need to alignment pill the Russian youth." Fast forward to the year 20XX and the haunted alignment pilled adults are now 'aligning' their bots to the world's top nuclear armed despot.

tony_soprano_how_could_this_happen.jpg (for some reason awful systems won't let me upload pictures anymore (ใƒŽเฒ ็›Šเฒ )ใƒŽ)

Holy Moses in heaven, iirc both Sam and Dario have said that their urge to build the torment nexus came from being inspired by online RAT forums. Maybe alignment 'pilling' youths is counterproductive to human flourishing? As the LWers say, "update your priors fuckheads"

view more: next โ€บ