Guide to High School Computer Science
  • 💻Introduction
    • windows & Python Development
    • macOS & Python Development
    • Visual Studio Code Settings
    • Set up Github
    • Author Page
  • 🧠Prerequisite Skills
    • Keyboard Typing
    • Files & Directories
    • Use of Command Line
    • Git & GitHub
    • Markdown
    • Starting Your Python Project
  • 🐍Python Programming
    • 🍎Python Basics
      • What is Python?
      • Procedural Programming & Programming Paradigms
      • String Formatting
      • Data Types
      • Input & Output to Console
      • Working with Numbers
      • Useful Built-in Functions
      • Math & Random Module
      • Boolean Data Object
      • Comparison, Logical, and Membership Operators
      • If Statements
      • Binary Decisions
      • Multiple Decisions
      • Nested Conditions
      • [EXTRA] Bitwise Operators
      • [EXTRA] Python Style Guide
    • ⏮️Iterations
      • Introduction to While Loops
      • Infinite Loop
      • Controlling Your While Loops
      • Introduction to For Loops
      • For Loops w/ Numeric Sequences
      • For Loops w/ Strings & Lists
      • Iterable Functions w/ For Loops
    • 📦Collections
      • Strings
        • String Basics
        • String Indexing
        • String Slicing
        • String Operators & Functions
        • Basic String Methods
        • String Methods Extended
        • String Methods Document
      • Tuples & Lists
        • Tuples
        • List Basics
        • List are Mutable
        • Adding Items to a List
        • Removing Items from a List
        • Search & Reverse a List
        • List Comprehension
        • List Methods Document
      • Sets
      • Dictionary
      • How to Store Multiple Data Items
    • 💡Defining Functions
      • Functions
      • print() vs return
      • Pre-determined Arguments
      • Nested Functions
      • Map & Filter
      • [Extra] Dynamic Arguments
    • 💾File I/O
      • How to Save Text to an External File
      • Reading CSV in Python
      • Reading JSON in Python
    • 🔨Basic Python Projects
      • Basic Calculator
        • Improving the calculator
        • Exercise Set 1
        • Exercise Set 2
        • 💎Streamlit Application #1
      • Basic Password Generator
        • Exercise Set 3
        • Exercises Related to Math
        • 💎Streamlit Application #2
      • A To-Do Task List
    • ⏳Introduction to Algorithmic Thinking
      • Big-O Notation
      • Basic Algorithms
        • Linear Search
        • Binary Search
        • Basic Sorting Algorithms
      • Recursion
      • Brute Force Algorithms
      • Greedy Algorithm
        • Time on Task (CCC 2013 J4)
        • Dijkstra’s Algorithm
      • Divide and Conquer
        • Merge Sort
      • Dynamic Programming
    • 🤯Object Oriented Programming
      • Class & Objects (Definitions)
      • OOP in Python
      • Encapsulation
      • Polymorphism
      • Inheritance & Overriding
      • Override Magic Methods
      • Case Study: 2D Vectors
      • Case Study: Deck of Cards
      • Exercise
      • Abstract Data Types
      • Case Study: Static 1D Array From Java
    • Competitive Programming
      • Is This Sum Possible?
        • Is the dataset sorted?
        • Searching for a value
        • Determine if the difference between an integer from the array and the target value exists
        • Sorting Algorithms
        • Using Two Pointers
      • Two Sum - LeetCode
        • Generate all possible pairs of values
        • Subtract each value from the target, see if the difference exists in the list
      • Longest Common Prefix - LeetCode
        • Compare all possible prefixes
        • Create the longest common prefix with the direct neighbour
      • Length of Last Word - LeetCode
        • Compare all possible prefixes
      • Where can I go from one point to another?
      • Sample Outline
    • IB Recipe Book
  • 💾Python & Databases
    • Intro to Databases & Data Modeling
      • Common Data Types in SQL
      • Introduction to ERDs
      • Primary Keys and Foreign Keys
      • Database Normalization
    • What is SQL?
      • Getting Started
      • SELECT Queries
        • Selection with Conditions
        • Selection with Fuzziness
        • Selection and Sorting in Order
        • Selection without Duplicates
        • Selection with Limited Number of Outputs
      • AGGREGATE Queries
        • Counting Rows
        • Sum, Average, Min/Max Queries
        • Working with Aggregate Queries
        • Power of using Groups
        • Exercise
      • Interacting with Multiple Table
      • Inserting Data
      • External Resource
  • ☕Java Essentials
    • Basics
      • Starting Java
      • Data & Variables
      • Handling User Inputs & Type Conversion
      • Arithmetic
      • IPO Model
      • Basic Built-in Methods
      • Exercise Questions
    • Conditionals
      • Boolean Operators
      • Compare Strings
      • If Statements
      • If Else Statements
      • Making Multiple Decisions
      • Using Switch
      • Flowchart Symbols
      • Exercise Questions
    • Iterations
      • While Loops
      • For Loop
      • Exercises
    • Java Type Casting
    • Strings
      • Common String Practices
      • String Formatting
      • Java Special Characters
    • Collection
      • Arrays
      • For Each Loop
      • ArrayList
      • Exercise Questions
    • Static Methods
      • (Aside) Clearing your Console
    • Randomness in Java
    • Delayed Output in Java
    • Java Output Formatting
    • Java Style Guide
  • 🛠️JavaScript Programming
    • Our Programming Editor & Workflow
      • Hello, world!
      • Commenting & Variables
      • Data in JavaScript
      • Operators
      • String Formatting
      • Getting User Input
    • JavaScript Exercise Set 1
    • Making Decisions
      • Comparing Values
      • Combining Boolean Comparisons
      • Creating Branches
    • JavaScript Exercise Set 2
    • While Loops
      • Infinite While Loop
      • While Loops and Numbers
      • While Loops and Flags
      • While loops w/ Strings
    • JavaScript Exercise Set 3
    • Subprograms & Functions
      • Creating a Function in JavaScript
      • Function with Input and Assignable Output
    • JavaScript Exercise Set 4
  • 💾Topics in CS
    • Computer Environments & Systems
      • Computer Components
        • In-depth Explanations
      • File Maintenance
      • Computer & Safety
      • Software Development
      • Bits & Binary
    • Careers related to Computer Science
    • Postsecondary Opportunities
Powered by GitBook
On this page
  • Normal Forms
  • Normalizing a Table Example
  • Example Table that is 2NF but not 3NF
  1. Python & Databases
  2. Intro to Databases & Data Modeling

Database Normalization

Database normalization is the process of organizing the fields and tables of a relational database to minimize redundancy and dependency.

The goal is to ensure that each piece of data is stored only once, which helps maintain data integrity and makes the database more efficient.

Reducing Data Redundancy

Normalization helps eliminate duplicate data by ensuring that each piece of information is stored in only one place. This reduces the amount of storage required and prevents inconsistencies that can arise when the same data is updated in multiple places.

Improving Data Integrity

By organizing data into related tables and defining relationships between them, normalization ensures that data remains accurate and consistent. For example, if a customer's address is stored in one table, any changes to the address need to be made only once, reducing the risk of errors.

Normal Forms

Normalization is typically carried out through a series of steps known as normal forms.

First Normal Form (1NF)

  • Ensures that the table has a primary key and that all columns contain atomic (indivisible) values.

    • There should also be no repeating groups or arrays.

Atomic Values

"identify something as small as possible"

Example of Values that are not atomic:

  • Contact Information: 123-456-7890, john@example.com

This is not atomic because the value in this column called ContactInformation contains multiple values at once: a phone number and an email address.

Their information should have been separated into multiple columns of PhoneNumber and Email

ID
Name
Phone1
Phone2
Phone3

1

John

123-456-7890

234-567-8901

345-678-9012

2

Jane

987-654-3210

NULL

NULL

No Repeating Groups or Arrays

  • In the table above: columns (Phone1, Phone2, Phone3) represent a repeating group because they store similar information

  • For the table to achieve 1NF; create separate table for customers and phone numbers.

Customers Table

ID
Name

1

John

2

Jane

PhoneNumbers Table

ID
PhoneNumber

1

123-456-7890

1

234-567-8901

1

345-678-9012

2

987-654-3210

Second Normal Form (2NF)

  • Achieved when the table is in 1NF and all non-key columns are fully dependent on the primary key.

Third Normal Form (3NF)

  • Achieved when the table is in 2NF and all columns are dependent only on the primary key, not on other non-key columns.

Normalizing a Table Example

OrderID
Name
ItemsOrdered
TotalAmount

1

John Doe

Apple, Oranges

15

2

Jane Smith

Bananas, Grapes, Apples

20

3

Bob Johnson

Oranges

5

1NF -> Make sure that each column contains atomic values and no repeating groups

Result:

OrderID
Name
ItemOrdered
TotalAmount

1

John Doe

Apples

15

1

John Doe

Oranges

15

2

Jane Smith

Bananas

20

2

Jane Smith

Grapes

20

2

Jane Smith

Apples

20

3

Bob Johnson

Oranges

5

2NF -> Remove Partial Dependencies

  • non-key attributes should depend on primary key

    • Make TotalAmount depend on OrderID

    • Split the table into Orders(OrderID, Name, TotalAmount) and OrderItems(OrderID, ItemOrdered)

Orders Table

OrderID
Name
TotalAmount

1

John Doe

15

2

Jane Smith

20

3

Bob Johnson

5

OrderItems Table

OrderID
ItemOrdered

1

Apples

1

Oranges

2

Bananas

2

Grapes

2

Apples

3

Oranges

3NF -> Remove Transitive dependencies

  • Non-key attributes should not depend on other non-key attributes.

    • Both tables: Orders and OrderItem are already 3NF.

Example Table that is 2NF but not 3NF

StudentID
CourseID
CourseName
InstructorID
InstructorName

1

101

Math

1001

Dr. Smith

1

102

History

1002

Dr. Brown

2

101

Math

1001

Dr. Smith

2

103

Science

1003

Dr. White

1NF Properties:

  • The table is atomic because each attribute in a record holds one value

  • There are no repetitive columns

2NF Properties

  • All non-key columns are dependent on composite primary keys (StudentID, CourseID)

This table fails to be 3NF because InstructorName depends on InstructorID not being dependent on both StudentID and CourseID

To normalize this table to 3NF, we need to remove the transitive dependency by creating a separate table for instructors:

StudentsCourses Table:

StudentID
CourseID
CourseName
InstructorID

1

101

Math

1001

1

102

History

1002

2

101

Math

1001

2

103

Science

1003

Instructors Table:

InstructorID
InstructorName

1001

Dr. Smith

1002

Dr. Brown

1003

Dr. White

PreviousPrimary Keys and Foreign KeysNextWhat is SQL?

Last updated 5 months ago

💾