Which is better for long-term, production-level data extraction on OnlyFans - API Sniffing or Webscraping?

I am heavily torn between these two methodologies in building out an in-house data extraction pipeline for an OnlyFans-related SaaS.

The system needs to extract the currently logged in user's data, such as their chats, sales, etc... (all with their explicit permission, it is what the app does). Everything is occurring in OnlyFans' website, and I am not yet aware of how they handle this entire dynamic - Are they strict? Is one method much more prone to detection/account banning than the other?

API Sniffing is obviously much more performant, and I would absolutely love to use it as the method when it comes down to being able to scale and handle more data, programmatically so much cleaner, resulting in lower user wait times, but I am very worried the result of its possible detection will be much more swift and brutal than going the scraping route.

Webscraping, being the more common approach, seems to have more possible approaches and defenses, and unsurprisingly much less performant in my tests.

This data extraction is the core of the SaaS, so the method being as scalable, stable, and cost-effective as possible is extremely important in this decision.

I plan on using robust proxy/ip rotating, etc... alongside both methods.

Would love any thoughts or perspective.

submitted by /u/frothymonk
[link] [comments]

from hacking: security in practice https://ift.tt/768mCcw

Comments