M.A.D.'s response to LLMs
M.A.D.'s response to LLMs
M.A.D.'s response to LLMs
You're viewing a single thread.
LLMs are Large Language Models and generate text, not images.
(ok, LLMs can't count either but still)
I was lazy the other day and I asked gemini to set an alarm for me, then asked how long it was until that alarm. Not even fucking close to the right amount of time. I figured it would be smart enough to just subtract for me..
Should have said 8 hours and 2 mins... Nope, just made up a time
Alarm set for 37:80 repeating every Blernsday
Since when can Gemini do anything regarding your phone? I've tried to do similar thing like a month ago, and it did say that it doesn't support these actions.
Apparently they are adding more actions with time. At some point maybe I'll be bothered to try it again
I think what they are talking about is the Gemini app which was installed from the play store it likely doesn't have access to everything. Android put a update that integrated it to replace androids assistant. I assume it is the same one they are rolling into the Google Home and whatever so they all work the same and sync through the Home app. (Haven't tried any of that, I keep off most everything these days. Used to love playing with new tech... Now I'm just tired)
It replaced the Google assistant in standard android. Overall it is a worse experience for me so far. It tries to give much more information and isn't as easy to navigate to me. Most common things I would do is set alarms, say things like "directions to Orange county DMV" or wherever. The alarms have gotten better but assistant always used to say, "your alarm is set for 6am" and id see a thing on screen that would say 12 hrs and 22 mins until the alarm and it would disappear about 3 seconds after it created it. Gemini doesn't have that. And if I ask for directions, it reads off things for like 30 seconds and makes it hard to just click on map or what not (nonsense shit too, like reading off the GPS coordinate, pronouncing all the pronunciation). I didn't go out of my way to update it, it's just a cheap Motorola phone because I broke my old Pixel in Phoenix on a work trip. For a $70 phone though, it really does do all I need these days though. First phone I've had that I can use it for hours during the day and the battery doesn't die by the end of the day, because the CPU/GPU aren't good enough to require high power use, haha. Not sure how it works with other phones these days but I used this phone to map 2 locations (about 20 miles total), make a phone call (30 mins), check out Lemmy (1.5 hours waiting), then turned it off, and came back 37 hours later and turned it on and my battery was at 92%. Sometimes having a slow processor has perks I guess.
Image generators are reverse llms, tbf. Steve Mould has a good explanation of it.
well, ish. llms have a vector space of words, image generators of features. they use a second model to associate words with features. Steve's explanation is a great intro but for a deep dive i recommend Self-Cannibalizing AI from 37C3.
Right but as I said in the other thread as well, what do you think is handling the text part of text-to-image creation tools?