The rep file: bbcnews.rep
The source file:bbcnews-src.htm
The result file:bbcnews-res.htm
(Above two HTML files need internet connection to see images)
The screenshot of the source file
The screenshot of the result file
Open BBC news and save them, then use Easy Replace, press "action" button, choose the bbcnews.rep from subfolder "example" inside the folder where this program locates, you can replace any or all or them, and click preview, or manually view them in browser, you will see the result - The just downloaded BBC news became pure news page, your news page, clean and tidy. If you can modify the rep file, you can get more exciting result.
Here's the source file of the rep file
/// 1 handle top rep ---$$begin--- to ---<table class="storycontent" cellpadding="0" cellspacing="0">--- with ---<html> <head> <meta http-equiv="content-type" content="text/html;charset=iso-8859-1" /> </head> <body> <div id=page> <table class="storycontent" cellpadding="0" cellspacing="0">---; /// 2 handle bottom rep ---<!-- E BO -->--- to ---$$end--- with ---<p id=author>Cleaned BBC News<br> An Easy Replace Example<br> by Wonder Stuido</p></div></body></html>---; /// 3 impose new style sheet rep ---</head>--- with --- <style> body{padding:2em;} body,p{font:normal 11px verdana;line-height:140%;} .cap{font-size:10px;margin:0 0 .5em 2em} #page { width:600px;} #title{color:#967;font-weight:bold;} h1,.artinfo {text-align:center} #author{font:normal 11px verdana;color:gray; text-align:right;margin-right:2em;} img{margin:1em 1em .5em 1em} </style> </head> ---;
All BBC news have the same file structure, after studing two or three BBC news page HTML code, we could easily found the boundary of the real news, so we could simply delete all the code outside the boundary. To improve the apprearance and add a signature I add some HTML code in the replace with expressions. You can also add waht you like. Of the style sheet, you can modify it yourself as your will. If with many pages, you can modify the replace with expression of the the first rep command, add an external style sheet link, and comment the thrid command.
This rep file is only for this example, you can download the practical version from our website, which doesnot contain the signature of mine and deployed with external style sheet for multi pages.
Sure, you would see, most websites' pages, have such regularized structure. If you know HTML, you can find the regulation, check out the keywords,and compose your rep file for a specified website.
If you're not familiar with HTML, don't worry, on our website, we will add more rep file for more websites, and we welcome those rep file composed by users. You can look for and download them there.