Calling all web Jedis... I need to build a bot.

edited August 2008 in advice
More specifically, I need to make a bot/crawler/thing that can search predetermined RSS feeds every 12 hours, look into them for specified criteria, determine content that fits said criteria the closest, and then do some stuff with it.

1. How far in over my head am I? I don't know much better than a little PHP at present.
2. What programming language can/should I use?
3. Should it run directly on my server or on a local machine and then send the data to my server for use on a site?
4. I been drinking, sorry.
5. But I really do need answers if you have them.

Comments

  • edited 11:42PM
    6. Please. :happy:
  • edited 11:42PM
    if its predetermined sites its easier

    you will need to set up a server cron job to initiate the php (or any other) script. so you'll need to know if you can set this up. alternatively have it run every time a page is accessed - whatever seems easiest.

    other than that its not too tricky

    if you want to crawl the entire web, that's harder
  • edited 11:42PM
    have a quick search for "php scrape" and i'm sure things will start falling into place :)
  • edited 11:42PM
    Thanks to both of you. I am now hung over, but ready to learn. :smile:
Sign In or Register to comment.