From Surf Wiki (app.surf) — the open knowledge base
Spam mass
Website ranking measure
Website ranking measure
Spam mass is defined as "the measure of the impact of link spamming on a page's ranking." The concept was developed by Zoltán Gyöngyi and Hector Garcia-Molina of Stanford University in association with Pavel Berkhin and Jan Pedersen of Yahoo!. This paper expands upon their proposed TrustRank methodology.
The researchers developed a good core and a bad core of selected Web documents, from which they measured spam mass across a collection of documents. Two types of measurements, absolute mass and relative mass, are used to compare groups of documents. The higher the mass measurements, the more likely the documents are to be equivalent to spam.
Thresholds
A threshold value is used to identify groups of documents as spam. If their relative mass value exceeds the threshold, the documents are considered to be spam. A second threshold for the PageRank values of the selected documents is applied. Only high PageRank documents are labelled as spam.
The purpose of the methodology is to identify spam documents with artificially inflated PageRank values.
References
This article was imported from Wikipedia and is available under the Creative Commons Attribution-ShareAlike 4.0 License. Content has been adapted to SurfDoc format. Original contributors can be found on the article history page.
Ask Mako anything about Spam mass — get instant answers, deeper analysis, and related topics.
Research with MakoFree with your Surf account
Create a free account to save articles, ask Mako questions, and organize your research.
Sign up freeThis content may have been generated or modified by AI. CloudSurf Software LLC is not responsible for the accuracy, completeness, or reliability of AI-generated content. Always verify important information from primary sources.
Report