Parrot—Similar Articles Search System - Sword Sharpening Hut (thw's BLOG)

by thw on 2007-07-04 18:27:21

On the Cool Stuff Blog, I found Parrot, a tool developed by mr-wednesday that is currently in the experimental stage. Its purpose is to find articles on the internet with similar content, which can be used for plagiarism detection and other applications.

After entering the homepage, you will see two input fields. The first field is for the hyperlink of the webpage where the article is located; please enter the hyperlink of the webpage where the article is located. The second field is the article content (mandatory), where you should input the content of the article you want to compare for similarity. You may input just a single paragraph or the entire article. Then press the "submit" button. The computation process takes about 30 seconds. Afterward, the system will list similar web links in descending order based on their similarity scores, along with their corresponding similarity score. The similarity score is a positive number greater than zero, and the higher the score, the more similar the content. Based on their experiments, webpages scoring over 20 points are considered highly similar.

PS: Some plagiarists are lazy yet still wish to take credit. By taking credit, they remove the original source information to claim they were the first to discover the information. However, this often leads to them being outsmarted by themselves because it's actually quite clear who came first and who came later through Google Blog Search. In fact, keeping the original link not only shows humility but also benefits your own blog. When one blogger is mentioned by another, they usually feel gratitude and think of reciprocating, leading to lively exchanges among bloggers. Moreover, among many veteran bloggers, there has formed an atmosphere where even if they don't repost each other's content, they are happy to mention each other while writing their blogs. As a result, their blogs' overall rankings are also very high.

Reposted from Mo Jian Lu (thw’s blog): http://www.thws.cn