this post was submitted on 21 Jun 2023
105 points (100.0% liked)
Reddit Migration
21 readers
2 users here now
### About Community Tracking and helping #redditmigration to Kbin and the Fediverse. Say hello to the decentralized and open future. To see latest reeddit blackout info, see here: https://reddark.untone.uk/
founded 1 year ago
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
I also made a similar bot, might be useful!, this one doesn't check every single post, only external links, I didn't want to repost original content or spam communities, so the post only gets one post per hour (which can be changed) and checks the community to see if there's a similar post there already (same link).
Edit: it uses the json page, and filters posts based on upvotes, downvotes, age, etc. It supports multiple subreddits and communities at the same time.
https://github.com/daniel-lxs/BotIt
Ah sweet I was about to start working on something like this. Will definitely use it in the future. Thanks
*edit: Ah it still uses the public json endpoints. A "real" scrapper might be necessary in the future.
You are right, I already had a scraper in progress, if it comes down to it I will use that instead.
Edit: also you can contribute if you want to, suggestions are 100% open.
I might if I find something I am missing.
I have some specific requirements when it comes to scrapping. I am planning to use it on my phone in termux which doesn't allow all nodejs modules. Sadly also the most popular scrappers which use a xserver in the background. So it has to be very basic html scrapping which should be possible when you do it on the old.reedit site.
I see, I used node bc that's what I'm most familiarized with. I'm running it from Digitalocean and seems to be stable for now.
But if you have different requirements you should definitely give making one a go.
Oh, nothing wrong with node at all. I used it for something very similar in the past myself. It's just that certain modules just don't work in termux.