Skip Navigation

How to keep bots and AI slop off lemmy?

I have noticed that lemmy so far does not have a lot of fake accounts from bots and AI slop at least from what I can tell. I am wondering how the heck do we keep this community free of that kind of stuff as continuous waves of redditors land here and the platform grows.

EDIT a potential solution:

I have an idea where people can flag a post or a user as a bot and if it's found out to be a bot the moderators could have some tool where the bot is essentially shadow banned into an inbox that just gets dumped occasionally. I am thinking this because then people creating the bots might not realize their bot has been banned and try and create replacement bots. This could effectively reduce the amount of bots without bot creators realizing it or know if their bots have been blocked or not. The one thing that would also be needed is a way to request being un-bannned if they get hit as a false positive. these would have to be built into lemmy's moderation tools and I don't know if any of that exists currently.

53 comments
  • Re: bots

    If feasible, I think the best option would be an instance that functions similarly to how Reddit’s now defunct r/BotDefense operated and instances which want to filter out bots would federate with that. Essentially, if there is an account that is suspect of being a bot, users could submit that account to this bot defense server and an automated system would flag obvious bots whereas less obvious bots would have to be inspected manually by informed admins/mods of the server. This flagging would signal to the federated servers to ban these suspect/confirmed bot accounts. Edit 1: This instance would also be able to flag when a particular server is being overrun by bots and advise other servers to temporarily defederate.

    If you are hosting a Lemmy instance, I suggest requiring new accounts to provide an email address and pass a captcha. I’m not informed enough with the security side of things to suggest more, but https://lemmy.world/c/selfhosted or the admins of large instances may be able to provide more insight for security.

    Edit 2: If possible, an improved search function for Lemmy, or cross-media content in general, would be helpful. Since this medium still has a relatively small userbase, most bot and spam content is lifted from other sites. Being able to track where bots’ content is coming from is extremely helpful to conclude that there is no human curating their posts. This is why I’m wary of seemingly real users on Lemmy who do binge spam memes or other non-OC. Being able to search for a string of text, search for image sources/matching images, being able to search for strings of text within an image, and being able to find original texts that a bot has rephrased are on my wishlist.

    Re: AI content

    AFAIK, the best option is just to have instance/community rules against it if you’re concerned about it.

    The best defense against both is education and critical examination of what you see online.

    • If you are hosting a Lemmy instance, I suggest requiring new accounts to provide an email address and pass a captcha

      Those are easy to bypass (or a human can spin up a bunch with throwaway emails and plug them into bots). I recommend enabling registration applications. While not foolproof, it gives the admins eyes on every new account. Also, consider denying any application that uses a throwaway email service.

    • If you are hosting a Lemmy instance, I suggest requiring new accounts to provide an email address and pass a captcha.

      The captchas are ridiculously ineffective and anyone can get dummy emails. Registration applications is the only way to go.

      • Plenty of websites filter out dummy email generators, could do the same in addition to applications. Making a drawing of something specific, but random (think of a list of a dozen or two images gen-ai gets wrong) could be a captcha replacement.

  • I was going to make a joke by posting an obviously stupid and AI generated answer, but I'm genuinely surprised by the insightfulness of Gemini's answer:

    Dealing with bots and AI-generated "slop" (low-quality, repetitive, or irrelevant content) is an ongoing challenge for any online platform, including Lemmy. Here's a breakdown of strategies and considerations: Challenges:

    • Federation Complexity:
      • Lemmy's federated nature means content comes from various instances, making centralized control difficult.
      • Moderation efforts need to be distributed, which can lead to inconsistencies.
    • AI Advancement:
      • AI-generated content is becoming increasingly sophisticated, making it harder to distinguish from human-created posts.
      • Bots can mimic human behavior, evading simple detection methods.
    • Balancing Freedom and Control:
      • Lemmy values user autonomy and free speech, which can conflict with aggressive moderation.
      • Overly strict measures might stifle legitimate contributions. Strategies:
    • Instance-Level Moderation:
      • Individual Lemmy instances play a crucial role in setting their own moderation policies.
      • Instance administrators can implement tools and rules to filter out unwanted content.
      • Blocking known bot-infested instances is a common practice.
    • Community-Driven Moderation:
      • Empowering users to report suspicious activity and content.
      • Implementing robust reporting mechanisms and clear guidelines.
      • Utilizing community moderators to review and take action on reports.
    • Technical Solutions:
      • Developing algorithms to detect patterns of bot behavior and AI-generated content.
      • Implementing CAPTCHAs or other verification methods to prevent automated accounts.
      • Enhancing Lemmy's software to provide better moderation tools.
    • User Filtering:
      • Lemmy itself has user level settings that allow for the hiding of bots. As shown in the search results, there are settings that can hide bot activity.
    • Community Education:
      • Raising awareness about the dangers of bots and AI slop.
      • Educating users on how to identify and report suspicious content.
    • Development of Lemmy Software:
      • As shown in the github issue, there is development discussion regarding excluding bot comments from sorting algorithms. This type of software development will be key to combating bot influence on the platform. Key Considerations:
    • The fight against bots and AI slop is an ongoing process.
    • A multi-faceted approach is necessary, combining technical solutions, community involvement, and responsible moderation.
    • Maintaining a balance between control and freedom is essential for the long-term health of the Lemmy network.
53 comments