🌐

Overview of Web Science Concepts

Jan 8, 2025

CS Option C: Web Science Lecture Notes

Introduction

  • Exhaustive information dump with a focus on memorization.
  • Main difficulty: PHP, HTML, MySQL.
  • Overlaps with Topic 3 (Networks).
  • HL content is random and weird.
  • A study guide will be released shortly.

The Internet vs. The World Wide Web

  • Internet: Global network infrastructure (hardware).
  • World Wide Web: Collection of web pages stored on internet infrastructure.
  • Internet involves ISPs, routers, switches, etc.
  • Uses protocols like TCP/IP, DNS, Email, FTP.
  • WWW uses HTTP.

Websites and Web Pages

  • Website: Collection of related web pages.
  • Web Page: Individual page within a website.
  • Accessed via web browsers using URLs.

HTTP and TCP/IP

  • HTTP: Protocol for transferring web pages.
  • TCP/IP: Fundamental protocol for internet communication.

HTTP Requests and Responses

  • Request made from client to server; response includes HTML/CSS/JavaScript.
  • Process includes DNS resolution, establishing TCP connection, and HTTP request/response cycle.

Secure HTTP (HTTPS)

  • Encrypted version of HTTP using SSL/TLS.
  • Uses digital certificates to verify website security.

Ports

  • Numeric identifiers for network services (e.g., HTTP - 80, HTTPS - 443).

Web Browsers

  • Software to access/display content on the WWW.
  • Examples: Google Chrome, Safari, Mozilla Firefox.

Cookies

  • Small text files stored in web browsers to remember user preferences and login info.

Web Application Development

HTML

  • Markup language for creating web pages.
  • Structure: head, body, footer, etc.

PHP and MySQL

  • Server-side scripting language for dynamic content generation.
  • MySQL used for database manipulation.
  • Integration: PHP processes HTTP request, interacts with MySQL, returns HTML/CSS/JavaScript.

CSS

  • Styles and formats HTML documents.
  • Can be stored in separate files and shared among web pages.

JavaScript

  • High-level programming language for client-side scripting.
  • Adds interactivity to web pages.

Static vs. Dynamic Web Pages

  • Static: Unchanging content (e.g., brochure websites).
  • Dynamic: Interactive, frequently changing content (e.g., social media sites).

XML and XHTML

  • XML: Encodes data in structured format.
  • XHTML: Combines XML and HTML features.

Search Engines

  • Software to search for information on the WWW using algorithms.
  • Process: Crawling, Indexing, Ranking, Results.

Search Engine Algorithms

  • PageRank: Evaluates quality of web pages based on in-links.
  • HITS: Evaluates based on authority and hub scores.

Search Engine Optimization (SEO)

  • Techniques to improve website ranking (e.g., keyword optimization, mobile optimization).
  • White Hat: Ethical SEO practices.
  • Black Hat: Unethical SEO practices.

Web Paradigms

  • Web 1.0: Static web pages.
  • Web 2.0: User-generated content, social media.
  • Web 3.0: AI, machine learning, blockchain.

Computing Paradigms

  • Grid Computing: Distributed computing across multiple networks.
  • Mobile Computing: Portable devices (e.g., smartphones).
  • Ubiquitous Computing: Integration of computing in everyday environments.
  • P2P Computing: Decentralized network of computers.
  • Cloud Computing: Remote servers for data storage and processing.

Interoperability and Standards

  • Interoperability: Ability of systems to work together.
  • Standards: Guidelines for interoperability (e.g., TCP/IP, HTML).
  • Open Standards: Publicly available standards.

Creative Commons License

  • Allows creators to share work while retaining some rights.

HL Topics

Graph Theory

  • Use of directed graphs to represent the web.
  • BOW-TIE Model: Structure of the web (core, in-components, out-components, tendrils, tubes).

Ambient and Collective Intelligence

  • Ambient Intelligence: Technology integration in environments.
  • Collective Intelligence: Collaborative problem-solving by groups.

Power Law Distribution

  • Statistical distribution relevant to web connections (few nodes with many links).

This concludes the lecture notes for CS Option C: Web Science.