Skip Navigation

Will Meta scrape and crawl through all our data now?

So from my understanding, I can block an instance which prevents it from showing in my feed.

However, if the instance I post to (.world) is not blocked on the receiving instance's end (Meta), they will still get my post (unless defederated)?

If so, doesn't that open up the idea that Meta will be able to scrape and take ALL the data from ALL the (still federated) instances' posts that are not blocked by the Meta instance(s)? How can I protect my information from Meta while still being federated, or is that not possible?

23 comments
  • They always could? These are public facing platforms. You’re being scraped by far more than just meta.

    • Yeah, absolutely nothing was preventing them from doing so already, without launching Threads.

      Blocking Meta / Threads instances isn't going to stop them, either.

  • What scraping can get is very little public information.

    There's a lot of information that servers keep contained such as IP addresses of where you are when you made a post. Other info such as your email address remains contained within your own instance. Meta cannot get at that information. No other Fediverse server can get at that.

    This blog from Gargoron (Eugen Rochko) who essentially created ActivityPub that underpin all these Fediverse systems including Mastodon, Calckey, Pixelfed, kbin, Lemmy etc.

    https://blog.joinmastodon.org/2023/07/what-to-know-about-threads/

    • What I wonder is how Lemmy handles this. He is writing about how Mastodon do things, not Lemmy.

  • If you post something public, people can access it. Corporations can access it. It's one of the reasons I ditched all my social media that identified me directly. They can scrape my stuff, sure; but they won't be able to link it to my actual name, face or existence in real life.

23 comments