[Done] New try at upgrading to 0.18.1 July 1st 20:00 CET
We'll give the upgrade new try tomorrow. I've had some good input from admins of other instances, which are also gonna help troubleshoot during/after the upgrade.
Also there are newer RC versions with fixed issues.
Be aware that might we need to rollback again, posts posted between the upgrade and the rollback will be lost.
We see a huge rise in new user signups (duh.. it's July 1st) which also stresses the server. Let's hope the improvements in 0.18.1 will also help with that.
PSA from Admin Team: The update completed roughly two hours ago. Since that time, the Admin team (and other site admins) have been working on the noted performance issues. We believe we have found a solution, but we still need time to test this out. You may still see brief outages and differences in performance as we are testing different configurations. We are trying to prevent rolling back.
While I know this can be frustrating - especially today - please keep in mind we have a team of volunteer techies (from around the globe!) collaborating on this issue. It is an inspiring situation. Also keep in mind that lemmy.world is quite a bit larger (and more active than any other instance). As such, we are a bit of a ‘test instance’ in regards to high volume requests. This is just part of the growing pain. We appreciate your understanding.
@ruud@lemmy.world will provide a debrief once we have completed testing.
Thanks for the kind words! Yeah, there are definite growing pains, and likely will be for some time (just do to the codebase we are working with, understandably). We have a really solid group though heading up lemmy.world though so we will be just fine ;)
Please try to clear the browser change
The new version is doing some of the API calls differently, which may cause issues with the old/cached version in your browser
The login form should be related to the overall spiking we were seeing, though I can't say we had conclusive evidence of that. I have been able to get it to load properly with a few refreshes. Maybe try clearing your cache? I had to do that several times during testing.
Performance is extremely uneven. Sometimes loads instantly, sometimes I get a timeout. Upvotes don't show up until reload. Still a lot better than the last attempt.
The server hamsters seem to be on fire, but I won't hold it against the service/community/individuals (except spez; fuck you spez). Lemmy is in the unfortunate position of being forced into a development model called "fuck it, we'll do it live!"
Heyo! Small update from someone who is watching the upgrade live; it's ongoing still. Seems like they are still facing some performance issues. So grab your popcorn and wait! 🍿 (Btw, i am not a sysadmin, just a moderator)
TFW you run a small server with an open source social platform for shits any giggles and one month it explodes a thousandfold because of one greedy pig boy.
I’ve been on here almost a month and there are usually very few issues and the admin team are quick to troubleshoot when an issue does arise. It should calm down soon!
Been here for a month~. Other than a few growing pains it's been pretty smooth. An initial learning curve on the whole instance/lemmy/fediverse thing and learning to sort by all/hot or active instead of just "local" to get ALL if the lemmy-verse instead of just your own instance. Once you learn how to use it it's a much better experience the reddit. Right now it's getting the reddit hug of death. But it will survive and flourish.
Just a heads-up. I ended up having to create a new login at lemm.ee because even after the improvements in speed and so on with the upgrade, for all intents and purposes it looks like my account here may have been wiped. Can't login from anywhere, getting "Record not found" whenever I try to sign in on Connect, "incorrect login" from Jerboa, and a spinning button and then nothing on a PC browser, basically treating my account as if it doesn't exist. Some folks appear to be able to login though. Is this an ongoing server issue, or did my account actually get wiped? Same username (Grangle1) as on lemm.ee.
Nice work with everything so far, really liking it. What all is being changed/fixed in the new revision? Is it just more efficient code to support higher site traffic?
Biggest thing is it entirely gets rid of the problem where the All feed would auto-update and add a ton of random posts while you were trying to browse. Really awful experience browsing by All, for a while there.
Just looked. First 1/2 loads were slow but after that it's lighting fast! I think by not everyone establishing a Websocket connection and just loading once performance should increase a tad bit.
You know, maybe I'm being overly dramatic, but I really feel like we're at the beginning of something special. I realize that Lemmy has existed for some years, but the whole Reddit thing is really pushing it into the limelight, and it's new to me (I joined just before the blackout).
Seeing all the developers and admins dealing with the inevitable bugs, working together to keep the federated instances operating together during a huge surge in users, like a flotilla of disparate ships lashed together facing a major storm - it makes me excited, nervous, and hopeful. I fully intend to support developers and instances financially, but also want to give a big "yay team" shout-out.
Not dramatic at all. This feels like starting fresh, and the enthusiasm and excitement makes it feel like the early 2010s of the internet again. I love it 😍
Time and knowledge are probably more important. A lot of well meaning people are copying over a bunch of Reddit content into communities here, but I think that's only useful in a small subset of them. If you're looking to scan through a bunch of memes, okay that's great. But a lot of people are looking for discussions and content to interact with, and a giant number of posts with no comments isn't helping that.
So starting or contributing to real discussions (like this one!) is a big contribution.
I think the solving the bugs and server issues is great to see, but I also think Lemmy is facing some deep issues about its design in the future, and it's going to take more than technical know-how to fix it.
Yeah, I agree. I think they can be worked out, but I don't think they have been yet. I think there's a lot of potential in the distributed architecture, and am excited to see it progress.
My lemmy server is also running the same versions are you are now. I was getting timeouts with lemmy.world federation so I recompiled lemmy_server with timeout changes:
diff --git a/crates/utils/src/lib.rs b/crates/utils/src/lib.rs
index e5d07db2c..e2c592d82 100644
--- a/crates/utils/src/lib.rs
+++ b/crates/utils/src/lib.rs
@@ -18,7 +18,7 @@ use std::time::Duration;
pub type ConnectionId = usize;
-pub const REQWEST_TIMEOUT: Duration = Duration::from_secs(10);
+pub const REQWEST_TIMEOUT: Duration = Duration::from_secs(13);
#[macro_export]
macro_rules! location_info {
diff --git a/src/lib.rs b/src/lib.rs
index cc77ca48f..45c621a7c 100644
--- a/src/lib.rs
+++ b/src/lib.rs
@@ -37,7 +37,7 @@ use tracing_subscriber::{filter::Targets, layer::SubscriberExt, Layer, Registry}
use url::Url;
/// Max timeout for http requests
-pub(crate) const REQWEST_TIMEOUT: Duration = Duration::from_secs(10);
+pub(crate) const REQWEST_TIMEOUT: Duration = Duration::from_secs(16);
/// Placing the main function in lib.rs allows other crates to import it and embed Lemmy
pub async fn start_lemmy_server() -> Result<(), LemmyError> {
Tried to logout, website on mobile won't release my session, and when I tried to login on jerboa it said password incorrect, not sure if that's a ui issue or a lemmy.world issue though
The progressive web app had ui issues after the update so I uninstalled it and now I can't figure out how to reinstall it 😕 tried wefwef but that doesn't load anything and can't log in. Happily staying patient though, hoping things smooth out!
I used to work with a guy who was a sysadmin for a moderately sized webhost in the mid-2000s when things like containers and cluster orchestrators didn't exist and high availability/multi-master database systems were only really accessible to banks.
He once described patching the servers "like trying to change all the tires on a car without being allowed to pull over".
This was how it went with the upgrade on lemm.ee as well, was very slow at first but improved as they fixed things up and all the weird caching stuff sorted itself out.
Oh, do you mind to write something regarding these upgrades, when you'll have time? I think lemmy.world could be one of the highest load rust apps :) Interesting how tokio and axum will show itself.
I sadly couldn't use the web version of lemmy.world at all cause of the websocket issues. 0.18+ is gonna be an amazing upgrade fore this instance! All the devs and staff who made this happen have my undying respect.
Good luck. This is my favorite instance so I will be eagerly awaiting. Lemmy has been incredible and strangely what I needed. I would doom scroll reddit and see the same posts and the same comment chains and just redundancy hell.
Lemmy feels so different and the community is so welcoming. Thanks again everyone for making this lost Redditor a found lemming (is that the correct term lol).
It's a bit of a lose lose situation tbh because if we don't upgrade then the first experience those new people are gonna have of Lemmy is an unusable All feed and a broken Jerboa.
My other account is on lemm.ee and the admin there contributes a lot of fixes to the software (plus has been working with Ruud to figure out our federation issues). He upgraded us a couple days ago and it's been amazingly stable given that it was only a day after the rc1 disaster.
Ultimately I trust that he and Ruud know what they're doing, but sometimes with an untestable situation like the size of .world you just have to pull the trigger and hope it works.
Liftoff is working great. Jerboa should be held off on for now because it can't handle different server versions. Different server versions WILL always be an issue in a federated environment.
Probably not, but surprisingly some 3P apps still work. I can confirm that Joey for Reddit and Infinity for Reddit are still able to access reddit. Seems like they don't pull the plug in one go? Or cherry picking the most demanding apps first?
For context, it's July 1st 16:27 p.m., GMT+9 where I am.
There might be less users influx than expected, though we'll see in the next few hours or days.
All of the apps will work until the devs pull the plug. Maybe those devs have decided to try to eat the cost and offset with subscriptions or something. I know the Apollo dev said it would cost him twenty million dollars per month to pay the API fees, so hopefully the devs of those other apps don't go bankrupt overnight.
A lot of improvements, but some weird problems. Jerboa finally allowed me to log in but not much else. Connect is working better, but won't allow me to reply to one particular post ("incorrect language" or something). The browser version and wefwef are slow and glitchy - my comment about birds ended up in a conversation about Lemmy apps. I don't know anything about tech stuff but I get the impression that people are working very hard to fix all the issues. And the spezidus wouldn't be making things easy. Looking forward to a bigger better Lemmy!
There are currently some issues with wefwef
With the upgrade also came some API & Backend changes. Wefwef is working on adapting to it, as far as I know.
Latest version of Jerboa on Google Play doesn't work with servers below 0.18;
For those who want to use Jerboa to access lemmy.world before the upgrade to 0.18.x happens, you can install a specific release from GitHub instead of Google Play. I found version 0.0.34-alpha to work (using it rn to post this). https://github.com/dessalines/jerboa/releases/tag/0.0.34
Edit: since the update is done, it's probably best to go back to latest version. Glad I helped someone!
Same here. I tried going to a link to another instance (maybe) and then subscribing and saw I wasn't logged in. I tried logging in and it just hung up or didn't respond. Also using Liftoff.
Hey thank you! I really thought the upgrade has started since I’m having difficulty fetching posts and commenting.
I can’t believe you still took the time to reply to comments as busy as you are. Lemmy is really like the early days of Reddit. I hope it never change.
I get the feeling that it's traffic. This instance has grown much larger, stupidly fast, over the last two weeks due to Reddexit (myself included, having arrived less than two weeks ago). I'm sure the admins are working to find ways to handle it.
I'm just going to exercise patience. Good things are worth waiting for!
I hope for the sake of the exodus that the servers can cope with the pressure. Lemmy.world is shaping up to be somewhat of a primary instance and having it stable would be crucial for any sort of migration to stick.
Cleared my cache and cookies as I thought it would be necessary like last time, but now I can't log in. The login button just turns into the animated wheel and then nothing happens. I'm typing this on a mobile where I still have an active session.
I think it's still struggling with the high user influx from Reddit and the 3rd party apps which stopped working today. And probably after the upgrade caches have to be rebuilt. Hope it improves soon.
A bit concerning perhaps hold this off for a week or so seems like this is very important moment for the users from r**dit to decide if they want to be here or not.
That's why it has to be done today. At the moment, Jerboa instantly crashes when trying to access Lemmy, which will definitely scare away new users. My understanding is that this is because Lemmy.World is on version 17, but Jerboa requires instances to be on version 18 or higher. If successful, I believe this would fix the instant crash issue, so we'll at least have an Android app working again.
Hopefully, these are just growing pains symptomatic of a site trying to deal with rapid growth and rapid improvements.
The difference between .17 and .18 is pretty substantial. Lemmy.world neglected to update to .18 because captcha support was not working for new account signups, so they waited for v0.18.1
There should be substantial performance improvements because it moves Lemmy from using websocket to HTTP API.
Websockets largely have a lower compute cost per request, HTTP requests are slow and expensive vs just firing off data in an already established TCP connection, so this isn't tracking for me?
Was it just the overhead of managing the websockets? Shouldn't an API gateway be doing that anyways...?
Hey man if y'all have planning meetings or anything for this stuff I'm willing to jump on and read the source code. 15 years engineering and dev, willing to put time in for you guys.