@SuitedUpDev as long as you don't crawl Fedi domains.
People here don't like that kinda thing.🤷♂️
@SuitedUpDev as long as you don't crawl Fedi domains.
People here don't like that kinda thing.🤷♂️
A quick question to my fellow #PHP developers here. Does anybody have a suggestion for Spider / crawler library in PHP?
I wanna crawl some domains under a certain TLD and keep track how much "outgoing" links are being referenced on the domains.
@BeAware I'm actually planning to crawl North Korean websites.
@BeAware I have a short list of domains that are reachable from the "regular" internet and I want to do some research on the data I can gather from their websites.
@BeAware Thanks a lot 🙏 I already have some experience with web scraping (from work) but not on this deep of a level.
GNU social JP is a social network, courtesy of GNU social JP管理人. It runs on GNU social, version 2.0.2-dev, available under the GNU Affero General Public License.
All GNU social JP content and data are available under the Creative Commons Attribution 3.0 license.