Surface web and deep web: Difference between revisions

From Computer Science Wiki
 
(3 intermediate revisions by the same user not shown)
Line 1: Line 1:
[[file:Connection.png|right|frame|Web Science<ref>http://www.flaticon.com/</ref>]]
[[file:Connection.png|right|frame|Web Science<ref>http://www.flaticon.com/</ref>]]


Internet content that is not capable of being searched by a web search engine is generally described as the deep web.<ref>https://en.wikipedia.org/wiki/Web_search_engine</ref>
== Distinguishing Between the Surface Web and the Deep Web ==
 
The World Wide Web can be divided into two primary layers based on accessibility and visibility: the '''surface web''' and the '''deep web'''. These terms highlight the contrast between publicly accessible content and information that remains hidden or restricted.
 
=== Surface Web ===
The '''surface web''' refers to the portion of the World Wide Web that is indexed by standard search engines like Google, Bing, and Yahoo. It is accessible to anyone with an internet connection and does not require special credentials or permissions to view.
 
Examples of surface web content include:
* Publicly accessible websites
* Blogs and articles freely available to the public
* Informational pages, such as Wikipedia
* Most online stores and their publicly visible product listings
 
Search engines can easily index and retrieve content from the surface web, making it the most familiar part of the internet for everyday users.
 
=== Deep Web ===
The '''deep web''', also referred to as the '''invisible web''' or '''hidden web''', consists of parts of the World Wide Web whose contents are not indexed by standard search engines. This content is hidden for various reasons, such as security, privacy, or the need for user authentication.
 
Key characteristics of the deep web:
* Content is not indexed by search engines and cannot be retrieved through standard search queries.
* It often resides behind HTTP forms, requiring user interaction to access.
 
==== Examples of Deep Web Content ====
 
The deep web includes a wide variety of online resources and services, such as:
* Webmail services: Email accounts provided by platforms like Gmail, Outlook, and Yahoo Mail.
* Online banking: User-specific account details and transactions protected by login credentials.
* Private social media pages and profiles: Content shared only with approved users or within restricted groups.
* Restricted web forums: Forums that require registration to view or post content.
* Paywalled services: Content behind payment barriers, such as:
** Subscription-based news outlets and magazines
** Video-on-demand platforms like Netflix or Hulu
* Academic and research databases: Institutional repositories or subscription-based services like JSTOR and PubMed.
 
==== Accessing the Deep Web ====
While the deep web is not publicly indexed, its content can still be accessed in specific ways:
* By using a direct URL (Uniform Resource Locator) or IP (
 


== Standards ==
== Standards ==

Latest revision as of 12:43, 14 January 2025

Web Science[1]

Distinguishing Between the Surface Web and the Deep Web[edit]

The World Wide Web can be divided into two primary layers based on accessibility and visibility: the surface web and the deep web. These terms highlight the contrast between publicly accessible content and information that remains hidden or restricted.

Surface Web[edit]

The surface web refers to the portion of the World Wide Web that is indexed by standard search engines like Google, Bing, and Yahoo. It is accessible to anyone with an internet connection and does not require special credentials or permissions to view.

Examples of surface web content include:

  • Publicly accessible websites
  • Blogs and articles freely available to the public
  • Informational pages, such as Wikipedia
  • Most online stores and their publicly visible product listings

Search engines can easily index and retrieve content from the surface web, making it the most familiar part of the internet for everyday users.

Deep Web[edit]

The deep web, also referred to as the invisible web or hidden web, consists of parts of the World Wide Web whose contents are not indexed by standard search engines. This content is hidden for various reasons, such as security, privacy, or the need for user authentication.

Key characteristics of the deep web:

  • Content is not indexed by search engines and cannot be retrieved through standard search queries.
  • It often resides behind HTTP forms, requiring user interaction to access.

Examples of Deep Web Content[edit]

The deep web includes a wide variety of online resources and services, such as:

  • Webmail services: Email accounts provided by platforms like Gmail, Outlook, and Yahoo Mail.
  • Online banking: User-specific account details and transactions protected by login credentials.
  • Private social media pages and profiles: Content shared only with approved users or within restricted groups.
  • Restricted web forums: Forums that require registration to view or post content.
  • Paywalled services: Content behind payment barriers, such as:
    • Subscription-based news outlets and magazines
    • Video-on-demand platforms like Netflix or Hulu
  • Academic and research databases: Institutional repositories or subscription-based services like JSTOR and PubMed.

Accessing the Deep Web[edit]

While the deep web is not publicly indexed, its content can still be accessed in specific ways:

  • By using a direct URL (Uniform Resource Locator) or IP (


Standards[edit]

These standards are used from the IB Computer Science Subject Guide[2]

  • Distinguish between the surface web and the deep web.

References[edit]

  1. http://www.flaticon.com/
  2. IB Diploma Programme Computer science guide (first examinations 2014). Cardiff, Wales, United Kingdom: International Baccalaureate Organization. January 2012.