Skip Navigation

POLL: Should we allow content migration bots?

Bots of this type have appeared recently, and people are asking if it's okay to use them. I'm not sure about this either, so I think it would make sense to ask users.

These bots follow some subreddits on Reddit and automatically post it to Lemmy when a post is created there.

I've seen an example site for it: lemmit.online. This instance is dedicated solely to mirroring Reddit posts to the Lemmy instance.

Maybe instead of mirroring to a community on Lemmy NSFW, we can subscribe to lemmit.online via Lemmy NSFW. This way we could have kept Lemmy NSFW free of bots. Currently, even if accepted, I believe it should be done under admin control to prevent duplicates.

Here is the poll: https://strawpoll.com/polls/PbZqRw82byN

I'm open to suggestions.

95 comments
  • I don't like reposts in general and certainly not automated bots.

    Lemmit is certainly useful for us who don't want to visit Reddit but don't want to miss on certain content, but that's what that instance is for. So keep the bots to instances like that, and the rest of Lemmy as free of bots as possible.

  • As others have said, I think it makes sense to have the top 40% of posts of all time, or something like that, reposted. As a mod for several subs here, those bots would help a lot when seeding content, and could even help me crank out posts of new, relevant content in the future.

    I don't think it's unreasonable for the amount of posts a bot can make a day to be limited. Or maybe a time frame that we allow content migration en mass like x date to x date, theb bot posting is limited.

  • For what it's worth, as the creator of lemmit.online, I totally understand people not wanting to see automated content bots. I, for one, wouldn't want to see them mixed with regular content. That is why I made sure to put it on its own instance, and not allow any users - so there would be minimal harm if a server would decide to defederate.

    And yes, NSFW content is allowed on my server :). For more answers, see this FAQ post. For more questions, please post them in that comment thread there.

    • Hi. Can you reach me via matrix? We can do a collaboration. My address is @xaeg:matrix.org

  • Thanks for the notification of this post @yay@lemmynsfw.com.

    I have created a script that would take the top (user-selectable) 0-1000 posts of a subreddit and post them to a Lemmy community. My plan was then to implement a vote threshold so that posts older than 48 hours and above a user-defined karma limit would be pulled in each time it was run - however the account login no longer works so I assume it and its posts were purged, so I'm here instead!

    I do think that in order to get people engaged, we need content to draw them in. I noticed that once I'd posted 50 items across I immediately started getting subscribers to the community.

    What I don't think is right is using bots to just replicate all the content on Reddit. As a moderator of several subs, a lot of content gets removed through moderation (hence the 48 hour limit), and a lot of junk gets through but just doesn't get upvoted (resulting in the karma threshold). Avoiding the "rubbish" would be good.

    My view is that using bots/scripts to seed communities means we can kick start them into life much more quickly, and then when a critical mass of users is reached they become irrelevant and can be disabled. I don't think we're here to just copy and paste from Reddit - otherwise surely you'd just go there instead.

    Edit: Just to comment on the poll itself. I don't think "bot only" communities make sense - we're not here to just copy Reddit... lemmit.online can do that. I believe we should allow bots to seed, and then let actual users take over. Unfortunately there isn't an explicit option for that so I just went with "Yes".

  • Sorry i wanted to add one more thing: it would be really nice if there was a way to do this without flooding new... as i think the people doing this dont really want to impact other communities (I just realized this is an issue, as i started porting content before realizing)

    One way we could try to minimize impact ( which is what im going to do for now) is ask anyone porting content to do it in off hours (like midnight EST would cover all of US and most of europe ?)

  • I think content import should be a thing across Lemmy, most users moving over have tons of content they've posted on Reddit, and having an easy way to bring that here would be great. But Lemmy isn't really built to handle bulk imports yet, if you simply hit the API of your instance it will flood /New on every instance that's indexed whatever sublemmy is being imported, and it will severely disrupt the use of the sub for a while. If content could be backfilled directly to the database with earlier timestamps it could be done smoothly, though.

  • I the past few days have been working on a quick c# app to simply pull the last 24 hour posts on a subreddit, and allow me to click a button to upload them individually here to kickstart a community to replace the subreddit. I think like what alot of people are saying these tools can help seed communities and boost engagement. i think in the long term we should start to block these tools, but i dont think the time is yet. This is still a new group, and while some communities on this instance are sizable, others are way to small to be sustainable yet :/

    Also If we could hopefully prevent the massive amount of content on reddit from disapearing, and give a place to preserve that that would definitly be amazing. although thats easy for me to say as the one who isnt hosting the files :/ (although for some communities the actual data for that isnt that significant, and since lemmy doesnt support galleries yet, galleries are still hosted offsite)

    Anyway thanks so much for the work you all are doing, I appreciate that you all went out of your way to enable communities and users to migrate off of reddit and hopefully to help form a better platform.

    • Hmm I was thinking of making the same. Some communities are just dead till they get bootstrapped with some little spam and then ppl feel encouraged to participate.

      • Someone else here had a smarter idea honestly of haveing a 48 hour buffer and using the reddit apis up vote ratio, and total karma to filter out alot of the spam. Honestly when I built this I had missed that part of the api, so I'm probably going to rework it in a bit

  • I voted for bot-specific communities, but I honestly think subscribing to them on other instances is better. It's what I've been doing already.

    Also, that other comment's idea about banning anything but OC, that's the worst idea I've ever read. At that point, you may as well just declare this an anti-2D instance, because adult art communities will never survive under such draconian rules. Even on Reddit, my sub only got about one OC post a month, and I doubt that one regular user has moved here.

  • I voted no, although I nearly voted for the "only in certain comms" option. I've got the beginnings of a tool that will make it easy for people to select content from the reddit backups, but I didn't intend to make it automatic, just something that helped people find content.

95 comments