Web scraping from a directory
Project details
Hi, we wish to have a dataset comprised of data from a well-known directory in the UK (which will be detailed when we award the job).
The website is a typical directory, where you search for a service and location and it lists the various results. The results page consists of approximately 25 results. Each result has a minimum amount of information which is consistent in all results, some have more information. We wish to capture as much as possible.
There will be two stages of web scraping:
– Stage 1: preliminary details from the main results page
– Stage 2: using the url’s generated from the above, we wish to then find the relevant contact details by going to each site and searching for phone number/email address and any relevant pictures. ** We will discuss Stage 2 in greater detail after Stage 1 has been successful. This project only refers to Stage 1.**
There are approximately 10k results in Stage 1.
You will need to employ IP rotation and headers.
The results from Stage 1 will need to be in a CSV file, with clear and easily understood headings. We will pay milestones at every 2.5k results sent to us.
Awarded to:

Md Asif H.
(4.8)
Awarded to:

Md Asif H.
(4.8)
Project details
The website is a typical directory, where you search for a service and location and it lists the various results. The results page consists of approximately 25 results. Each result has a minimum amount of information which is consistent in all results, some have more information. We wish to capture as much as possible.
There will be two stages of web scraping:
– Stage 1: preliminary details from the main results page
– Stage 2: using the url’s generated from the above, we wish to then find the relevant contact details by going to each site and searching for phone number/email address and any relevant pictures. ** We will discuss Stage 2 in greater detail after Stage 1 has been successful. This project only refers to Stage 1.**
There are approximately 10k results in Stage 1.
You will need to employ IP rotation and headers.
The results from Stage 1 will need to be in a CSV file, with clear and easily understood headings. We will pay milestones at every 2.5k results sent to us.
skills of Md Asif H.
skill ? | level ? | projects ? | ||
---|---|---|---|---|
Web Scraping |
70%
|
385 | ||
Data Mining |
57%
|
171 | ||
Python |
46%
|
171 | ||
Data Warehousing |
45%
|
1 | ||
Data Visualization |
45%
|
1 | ||
Web Search |
44%
|
128 | ||
Excel |
43%
|
214 | ||
Data Analytics |
43%
|
1 | ||
Data Science |
41%
|
1 | ||
Data Analysis |
41%
|
1 | ||
Data Entry |
40%
|
171 | ||
Software Architecture |
38%
|
86 | ||
Data Processing |
37%
|
43 | ||
PHP |
37%
|
128 | ||
Website Design |
35%
|
43 | ||
HTML |
35%
|
43 | ||
Software Development |
35%
|
1 | ||
XHTML |
35%
|
1 | ||
Web Development |
35%
|
1 | ||
Landing Pages |
35%
|
1 | ||
HTML5 |
34%
|
1 | ||
Website Build |
34%
|
1 |
Some Md Asif H. projects
Project Title | Skills required in the project |
---|---|
Scrap data details from dashboard | Python Web Scraping |
Extract emails ~5.000 very fast! Cheapest offer wins | Data Entry Data Mining Excel Web Scraping Web Search |
Simple Web data scrapping | Data Mining PHP Python Software Architecture Web Scraping |
Data Entry – Input Webpage Data | Data Entry Data Processing Excel Web Scraping Web Search |
Collect data from Web | Data Entry Excel PHP Python Web Scraping |
scrape directory info — 2 | Python Web Scraping |
Scrape this site | Data Entry Data Mining Excel Web Scraping Web Search |
Scraping a website | HTML PHP Software Architecture Web Scraping Website Design |
Web scraping into excel | Data Mining Excel Web Scraping |