Please see detailed requirements for project details. :-)
## Deliverables
I need content scraping from various article websites online and posted to my central wordpress and two other paltform blogs. I have Hostgator hosting account to work this on. Other platforms apart from the wordpress that I would like the bot to support are blog engine, movatype and blogger or if these are not possible any others you suggest as long as we have three platforms to work on! But out of them you decide which ones are technically the best for this. The article websites to scrape from that I have gathered for this process are:
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
The bot either has to be server driven or run as a desktop application on my home PC. The process in which the bot will work is:
1) Bot takes a fixed amount of categories from the chosen article website that it's going to scrape content from and sets those categories up on our blog sites.
2)Bots goes to chosen articles website and selects 5-10 random articles each day in a category. We need to make chosen articles are not duplicated.
3) The bot takes those articles and spins it via a online spinning website ([login to view URL]) or a desktop spinning software I have installed on my desktop! If we are using the online solution then we will need to in coorperate the use of public proxies which I get a fresh batch daily.
4) The bot then posts the spun article with a outbound link to a website that's related to the main keyword of the article, the link is posted within the article as a keyword anchor text.
And that's its in simple terms I have also gathered a list of website that I think are already running software similar to what I am proposing so you can have a look how their layouts and articles postings and platforms if you want. I have listed them below:
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
[login to view URL]
Other information:
Comments on our blog should be disabled so no user interaction at all. The bot if run on the server should run automatically on my domains 24/7 per day every day! Of decided to be desktop driven then scheduled through Windows 7 or similar when the PC is on.
Also the blog roll on some of the website is out linking to some high PR sites so maybe we can have this to.