Method for checking plagiarized content and pseudo-original content

If you are a senior Google SEO operator, then you are likely to outsource the content of the website to foreigners to create, but how to determine whether the foreigner is fully committed when writing content? And whether the content they give is plagiarized, copied, or pseudo-original?

I believe that many Google SEOers in China are having headaches for this problem. The same is true for Yiquan’s teachers. The English level is poor, and it is not okay to rely on reading to distinguish it! (The university is all messed up, the English class has not been taken, and I have passed the fourth level with luck. Although the college entrance examination is about 140 points, the university has all returned …)

Insufficient ability, brains come together. Because the company asks foreigners to write a dozen articles every day, adding up to at least 10,000 words, so plagiarism and pseudo-original must be checked. After checking a lot of information and practical tests, Yiquan teacher probably concluded There are several effective methods.

Method 1: Put it in Google Translate and check whether the content translated into Chinese is fluent and organized.

Pseudo-original is usually to replace the original synonyms and synonyms. After the replacement, it is likely that the previous and subsequent sentences are unsound, and after being translated into other languages, it is completely unreadable, so we use the method of translation to detect. (Now Google Translate is very powerful, the more natural and smooth the articles written by foreigners, the translation is basically completely readable)

For example, today a foreigner has just submitted a draft, we put a certain paragraph in Google Translate, as shown below, the translated Chinese is generally very smooth:

How to detect plagiarized content and pseudo-original content?

If the sentence of the whole article is very smooth and the logic of the whole article is clear, then basically, even if it is marked, of course, it is necessary to determine whether it is plagiarism. This uses other methods below.

(Some industries are special, even if they are handwritten by foreigners, they may not be readable after translation, which is difficult to judge through Google translation)

Method 2: Use special detection tools for detecting plagiarism and pseudo-original (four)

Regarding the tools that have this function, Yiquan has found a total of four. The earliest one is the duplichecker. This has been mentioned in an earlier article. Those who are interested can look back.

The second one is spinmenot (URL: https://www.spinmenot.io)

This is the only tool I ’ve found on Google that clearly states that the function is to detect pseudo-original content. The demand for original testing is mainly in the non-native English speakers, so there is very little development of this kind of software on Google)

According to their own introduction, the principle of the tool is that AI analyzes the characteristics of pseudo-original articles, and then judges what we detect.

Spinmenot has three indicators: 40% or more, the possibility of pseudo-original is low; 40% -60%, which is more likely to be pseudo-original; 60% or more, pseudo-original is very likely.

The same is the article delivered today, we use spinmenot detection results, as shown below:

How to detect plagiarized content and pseudo-original content?

This tool feels pretty good, because it is completely free, but it does not give a clear explanation about how it is judged, so it cannot be completely convinced, and can only be used as a preliminary reference.

The third tool: copyscape paid version (address: www.copyscape.com)

Copyscape is almost Google’s most prestigious plagiarism-original detection tool. It was the first to detect student papers, but later discovered a bigger business opportunity. It has become the current copyscape, basically equivalent to a content search engine.

Because the free version of copyscape only allows input link detection, and the demand is large, so I can only buy the paid version (the price is 1900 words 0.2 USD)

Still the above article, the paid version test results are as follows:

How to detect plagiarized content and pseudo-original content?

As shown in the figure above, the detection of copyscape shows that there is no problem. If there is no fourth tool, I will be very happy to see this result.

The fourth tool: quetext (address: www.quetext.com)

This is a very good tool. The main function is to check whether your content is plagiarized. The free one can only detect 500 words at a time. The paid one is also very cheap. It is 9 dollars a month and I bought it decisively.

Look at the test results (with an article used above):

How to detect plagiarized content and pseudo-original content?

The above version is still the free version, less than 500 words, the inspection result is 7% repetition, click the repetitive content marked with yellow underline, you can see a more detailed explanation (with what content is repeated, and the repetition situation).

The quetext that cannot be detected by Copyscape is completely detected. According to the comprehensive judgment, quetext is slightly better. (Of course, the reason may be that copyscape is too famous, so many people use it to detect it. Similarly, writers also modify it according to copyscape, so it is impossible to detect anything with copyscape now.)

In addition to the above four tools, there is no other way to detect pseudo-original, of course there are, but it is a little more laborious, such as using the intext search character (interesting Google, very simple), combined with the content given by foreigners in Google Perform a search; or analyze punctuation marks to determine whether it is a pseudo-original. It is recommended to read this article: “How does Google judge original articles and pseudo-original”.