Surface web and deep web: Difference between revisions

From Computer Science Wiki
No edit summary
No edit summary
Line 1: Line 1:
[[file:Connection.png|right|frame|Web Science<ref>http://www.flaticon.com/</ref>]]
[[file:Connection.png|right|frame|Web Science<ref>http://www.flaticon.com/</ref>]]


The deep web, invisible web, or hidden web are parts of the World Wide Web whose contents are not indexed by standard web search-engines. The opposite term to the deep web is the "surface web", which is accessible to anyone/everyone using the Internet. 
== Distinguishing Between the Surface Web and the Deep Web ==


The content of the deep web is hidden behind HTTP forms and includes many very common uses such as web mail, online banking, private or otherwise restricted access social-media pages and profiles, some web forums that require registration for viewing content, and services that users must pay for, and which are protected by paywalls, such as video on demand and some online magazines and newspapers.
The World Wide Web can be divided into two primary layers based on accessibility and visibility: the '''surface web''' and the '''deep web'''. These terms highlight the contrast between publicly accessible content and information that remains hidden or restricted.
 
=== Surface Web ===
The '''surface web''' refers to the portion of the World Wide Web that is indexed by standard search engines like Google, Bing, and Yahoo. It is accessible to anyone with an internet connection and does not require special credentials or permissions to view.
 
Examples of surface web content include:
* Publicly accessible websites
* Blogs and articles freely available to the public
* Informational pages, such as Wikipedia
* Most online stores and their publicly visible product listings
 
Search engines can easily index and retrieve content from the surface web, making it the most familiar part of the internet for everyday users.
 
=== Deep Web ===
The '''deep web''', also referred to as the '''invisible web''' or '''hidden web''', consists of parts of the World Wide Web whose contents are not indexed by standard search engines. This content is hidden for various reasons, such as security, privacy, or the need for user authentication.
 
Key characteristics of the deep web:
* Content is not indexed by search engines and cannot be retrieved through standard search queries.
* It often resides behind HTTP forms, requiring user interaction to access.
 
==== Examples of Deep Web Content ====
The deep web includes a wide variety of online resources and services, such as:
* Webmail services: Email accounts provided by platforms like Gmail, Outlook, and Yahoo Mail.
* Online banking: User-specific account details and transactions protected by login credentials.
* Private social media pages and profiles: Content shared only with approved users or within restricted groups.
* Restricted web forums: Forums that require registration to view or post content.
* Paywalled services: Content behind payment barriers, such as:
  * Subscription-based news outlets and magazines
  * Video-on-demand platforms like Netflix or Hulu
* Academic and research databases: Institutional repositories or subscription-based services like JSTOR and PubMed.
 
==== Accessing the Deep Web ====
While the deep web is not publicly indexed, its content can still be accessed in specific ways:
* By using a direct URL (Uniform Resource Locator) or IP (


The content of the deep web can be located and accessed by a direct URL or IP address, but may require a password or other security access to get past public-website pages.<ref>https://en.wikipedia.org/wiki/Deep_web</ref>


== Standards ==
== Standards ==

Revision as of 12:42, 14 January 2025

Web Science[1]

Distinguishing Between the Surface Web and the Deep Web[edit]

The World Wide Web can be divided into two primary layers based on accessibility and visibility: the surface web and the deep web. These terms highlight the contrast between publicly accessible content and information that remains hidden or restricted.

Surface Web[edit]

The surface web refers to the portion of the World Wide Web that is indexed by standard search engines like Google, Bing, and Yahoo. It is accessible to anyone with an internet connection and does not require special credentials or permissions to view.

Examples of surface web content include:

  • Publicly accessible websites
  • Blogs and articles freely available to the public
  • Informational pages, such as Wikipedia
  • Most online stores and their publicly visible product listings

Search engines can easily index and retrieve content from the surface web, making it the most familiar part of the internet for everyday users.

Deep Web[edit]

The deep web, also referred to as the invisible web or hidden web, consists of parts of the World Wide Web whose contents are not indexed by standard search engines. This content is hidden for various reasons, such as security, privacy, or the need for user authentication.

Key characteristics of the deep web:

  • Content is not indexed by search engines and cannot be retrieved through standard search queries.
  • It often resides behind HTTP forms, requiring user interaction to access.

Examples of Deep Web Content[edit]

The deep web includes a wide variety of online resources and services, such as:

  • Webmail services: Email accounts provided by platforms like Gmail, Outlook, and Yahoo Mail.
  • Online banking: User-specific account details and transactions protected by login credentials.
  • Private social media pages and profiles: Content shared only with approved users or within restricted groups.
  • Restricted web forums: Forums that require registration to view or post content.
  • Paywalled services: Content behind payment barriers, such as:
 * Subscription-based news outlets and magazines
 * Video-on-demand platforms like Netflix or Hulu
  • Academic and research databases: Institutional repositories or subscription-based services like JSTOR and PubMed.

Accessing the Deep Web[edit]

While the deep web is not publicly indexed, its content can still be accessed in specific ways:

  • By using a direct URL (Uniform Resource Locator) or IP (


Standards[edit]

These standards are used from the IB Computer Science Subject Guide[2]

  • Distinguish between the surface web and the deep web.

References[edit]

  1. http://www.flaticon.com/
  2. IB Diploma Programme Computer science guide (first examinations 2014). Cardiff, Wales, United Kingdom: International Baccalaureate Organization. January 2012.