Dynamic RSS-based HTML Scraper.

This python code creates an HTML dynamically from TheMarker.com RSS feed in Hebrew.

Screenshot

Technologies Used: Python 2/3.X (feedparser, yattag, lxml, requests)

Description

No one loves ads. TheMarker.com is full of them. This is a Python-HTML-scraping that scrapes TheMarker, an Israeli news Website and creates a clean, new static HTML that retains only the articles' name, main image and content. This results in an easier to read, ad-free way of accessing the content on the website.

Installation

Install the python modules listed above, and simply run 'TheMarker.py'. Optionally, select the required feed to be generated; otherwise simply press enter. An HTML will be saved in the same directory as your RSS.py file, and automatically opened using your default browser.

Example
Example

Figure 1: Two articles generated by the script.

Back to my projects →