Skip to content

Makyl/medium-scrape

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Synopsis

The problem statement was to find all links present on medium.com with the restriction that not more than 5 requests are made per second (so as to not spam their servers). It also has to generate the csv file with the link. The first command without async library generates links.csv while the one with async library generates asyncLinks.csv. It find all the links that point to medium.com only, not that link to other websites but enabling that won't be a major thing provided it doesn't needs to be scraped. There are two functions made, one that uses async library and one that doesn't.

Installation

  1. Clone the repository

  2. Run npm install

  3. After installation is complete, run the following commands

API Reference

Currently there are two functions which can be run by these commands:

  1. npm run startWithoutAsync

  2. npm run startWithAsync

License

NONE

About

An app to find all hyperlink on medium.com

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published