ASPit - Totally ASP JSit - Totally JavaScript
Search PHPit

Use this textbox to search for articles on PHPit. Seperate keywords with a space.

Advertisements

Screen scraping your way into RSS

(Page 1 out of 3)

RSS is one the hottest technologies at the moment, and even big web publishers (such as the New York Times) are getting into RSS as well. However, there are still a lot of websites that do not have RSS feeds.

If you still want to be able to check those websites in your favourite aggregator, you need to create your own RSS feed for those websites. This can be done automatically with PHP, using a method called screen scrapping. Screen scrapping is usually frowned upon, as it�s mostly used to steal content from other websites.

I personally believe that in this case, to automatically generate a RSS feed, screen scrapping is not a bad thing. Now, on to the code!

Next: The code »



4 Responses to “Screen scraping your way into RSS”

  1. What the? Says:

    When trying to view the code:

    Security Alert! Breach has been logged for review (IP Address: xxx.xxx.xxx.xxx).

  2. Robert M Says:

    I’m getting the same in Firefox.

    Security Alert! Breach has been logged for review (IP Address: xxx.xxx.xxx.xxx).

  3. Jim M Says:

    Viewing the code in Internet Explorer 6 works fine. Thanks for posting this example! :)

  4. rigorouslogik Says:

    The code works great!
    How could I scrape just meta data or links ect into a text file?
    Thanks for any input.

Leave a Reply

About the author
Dennis Pallett is the main contributor to PHPit. He owns several websites, including ASPit and Chill2Music. He is currently still studying.
Article Index
  1. Introduction
  2. The code
  3. Conclusion
Bookmark Article
Download Article
PDF
Download this article as a PDF file