Hi
We are looking for someone to scrape a list of 20 e-commerce website.
The project would be split in two parts :
1st part : Scrapping
The requirements are :
get the list of products
get the product url / name / description / color (if available) / price / categories and sub-categories
get all the images linked to each product and download them in HD
for each image, save the url and position (1, 2, 3 ...) on the product page
The expected results are for each site :
an archive with all the HD images downloaded
a csv/json containing the list of product with associated information and images
- different colors need to be considered as different product because they have different sets of images (pimkie has different urls for each color, you cannot see them from the product page but you can see them from the lists)
- to generalize the data across websites, there's only one category field, which should be an array containing more than what you selected here. For pimkie, I would put the whole breadcrumb as categories (except the first and last), which would meand : ["Vêtements", "PRINTEMPS/ETE", "Jean", "jean mom"]
- images need to be in higher resolution (around 3000x3000, or at least 2000x2000), so you need to either find a way to find the "zoomed" version, or in this case (which happens quite often), the size is specified in the url, so you can just replace "sw=760&sh=938" by "sw=3000&sh=3000"
- images need to be downloaded
2nd part : Image identification :
The requirements are :
identify the "flat lay" images (only the front ones)
identify the "on model" images
identify the matching "flat lay" and "on model" pairs (same product, same color)
The expected results are for each site :
a csv/json list of identified images and whether they are "flat lay" or "on model" (ignore the others)
a csv/json list of matching "flat lay" and "on model" image pairs
Kindly refer to the attachment as well.
I need you to send 1 completed scrapped listing with accuracy (from [login to view URL] ) for me to assume that you have understood the project well.
HI I am experienced in Web Scraping Software Architecture Data Mining etc I can start right now but i have few doubts and questions lets have a quick chat and get it started waiting for your reply
Hi, I am a python developer having hands on experienced in web scraping. I have scraped almost 50 sites till now, parsed and cleaned each of sites and saved all information, images, urls in database.
I hope all above info satisfy for your project, lets have a chat and start the project
thank you