Skip to content
Surf Wiki
Save to docs
technology/algorithms

From Surf Wiki (app.surf) — the open knowledge base

Webgraph

Graph of connected web pages


Summary

Graph of connected web pages

A webgraph is a set of directed links between pages of the World Wide Web. A graph, in general, consists of several vertices, some pairs connected by edges. In a directed graph, edges are directed lines or arcs. The webgraph is a directed graph, whose vertices correspond to the pages of the WWW, and a directed edge connects page X to page Y if there exists a hyperlink on page X, referring to page Y.

Properties

  • The degree distribution of the webgraph strongly differs from the degree distribution of the classical random graph model, the Erdős–Rényi model: in the Erdős–Rényi model, there are very few large degree nodes, relative to the webgraph's degree distribution. The precise distribution is unclear, however: it is relatively well described by a lognormal distribution, as well as the Barabási–Albert model for power laws.
  • The webgraph is an example of a scale-free network.

Applications

The webgraph is used for:

  • computing the PageRank of the world wide web's pages;
  • computing the personalized PageRank;
  • detecting webpages of similar topics, through graph-theoretical properties only, like co-citation;
  • and identifying hubs and authorities in the web for HITS algorithm.

References

References

  1. (2008). "Introduction to Information Retrieval". Cambridge University Press.
  2. Erdős, Paul. (1960). "On the evolution of random graphs". Publication of the Mathematical Institute of the Hungarian Academy of Sciences.
  3. (2015). "The Graph Structure in the Web - Analyzed on Different Aggregation Levels". Journal of Web Science.
  4. (October 1999). "Emergence of scaling in random networks". Science.
  5. Brin, Sergey. (1998-04-01). "The anatomy of a large-scale hypertextual Web search engine". Computer Networks and ISDN Systems.
  6. Glen Jeh and Jennifer Widom. 2003. Scaling personalized web search. In Proceedings of the 12th international conference on World Wide Web (WWW '03). ACM, New York, NY, USA, 271–279. {{doi. 10.1145/775152.775191
  7. (1999). "Trawling the Web for emerging cyber-communities". Computer Networks.
Wikipedia Source

This article was imported from Wikipedia and is available under the Creative Commons Attribution-ShareAlike 4.0 License. Content has been adapted to SurfDoc format. Original contributors can be found on the article history page.

Want to explore this topic further?

Ask Mako anything about Webgraph — get instant answers, deeper analysis, and related topics.

Research with Mako

Free with your Surf account

Content sourced from Wikipedia, available under CC BY-SA 4.0.

This content may have been generated or modified by AI. CloudSurf Software LLC is not responsible for the accuracy, completeness, or reliability of AI-generated content. Always verify important information from primary sources.

Report