Course Content
Module 1 – Getting Started with Python
introduced the fundamentals of Python, giving beginners a clear understanding of how the language works and how to start writing simple programs. Python was highlighted as a beginner-friendly language with simple syntax, making it easy to read and write code.
0/7
Module 2 – Introduction to Python Programming
In this Introduction to Python module, learners explore Python’s clear, readable syntax and powerful features. Beginning with installation and a simple “Hello, World!” script, you will progress through variables, control flow and functions using step-by-step examples. By the end, you will be equipped to write your own Python programmes, automate routine tasks and tap into an extensive library ecosystem for real-world projects.
0/7
Basic Command for Command prompt, PowerShell, Zsh(macOS)
0/1
Module 3 – Variables, Data Types and Basic Operations
In the Variables, Data Types and Basic Operations in Python module, learners explore how to store and manage data using variables, master fundamental types such as integers, floats, strings and booleans, and perform arithmetic, comparison and logical operations step by step. Clear explanations, real world examples and hands on exercises guide you through writing and debugging code. By the end of this module, you will be ready to build dynamic Python programs and automate everyday tasks.
0/6
Module 4 – Control Flow – Conditions and Loops
Control flow structures determine the order in which your program’s code executes. With conditional statements, you can make decisions and execute certain code blocks only when specific conditions are met. Loops allow you to repeat actions efficiently without writing redundant code. In this module, we will explore fundamental control flow concepts in Python in a step-by-step manner, similar to Microsoft’s learning curriculum. By the end, you’ll understand how to use if, elif, and else statements (including nested conditions) for decision-making, how truthy and falsy values work in Boolean logic, how to construct for loops (using range() and iterating over collections), how to use while loops along with loop control statements (break and continue), and how to leverage list comprehensions and generator expressions for concise looping. Finally, we’ll apply these concepts in a practical exercise to build an interactive decision-making system. Each section below includes explanations, code examples, and mini-exercises to reinforce the concepts, all formatted for clarity and easy follow-along.
0/8
Day 1 Summary
We covered Modules 1, 2 & Module 3 (Lesson 1 & 2)
0/1
Module 5 – Functions and Code Organisation
Imagine you need to clean up a messy data set or send a personalised email to each customer. Instead of writing the same steps over and over, you can create a function and call it whenever you need. In this lesson on Functions and Code Organisation, you will learn how to define functions, pass and return information, document your work and group related code into modules for easy reuse and maintenance.
0/10
Day 2 Summary
Summary for Day 21 Aug 2025
0/1
Day 3 Summary
Summary of Day 28 Aug 2025
0/1
Module 7 – Working with Files and Folders
In this lesson, we will learn how to manipulate files and directories using Python. We’ll explore common file operations using the os module, and see how the pathlib module provides an object-oriented way to handle file paths. We’ll also use the glob module for pattern-based file searches and learn file I/O operations for text, CSV, and binary files. Additionally, we’ll introduce the calendar and time modules to work with dates and timestamps. Finally, an interactive lab will tie everything together by automating a folder backup and cleanup task. Follow the step-by-step sections below for each subtopic, try out the code examples, and explore the guided lab at the end.
0/9
Module 8 – Error Handling and Debugging Techniques
In this lesson, we will learn how to handle errors in Python programs and how to debug code effectively. Errors are inevitable, but knowing how to manage them ensures our programs don't crash unexpectedly. We will cover the difference between syntax errors and exceptions, how to use try, except, else, and finally blocks to catch and handle exceptions, and how to raise your own exceptions (including creating custom exception classes). We’ll also explore debugging strategies: using simple print statements or the logging module to trace your program’s execution, and using Python’s interactive debugger pdb to step through code. By following best practices for error handling and debugging, you can write resilient, maintainable code. Throughout this lesson, try the examples and exercises to practice these techniques.
0/9
Day 4 Summary
0/1
Module 9 – Automating Excel and PDFs with Python
In this lesson, you will learn how to automate common communication and reporting tasks using Python. We will cover sending notifications via email, messaging platforms, and SMS, as well as manipulating Excel spreadsheets and PDF files programmatically. Each section below includes step-by-step explanations, code examples, and interactive exercises to reinforce your understanding. By the end of this lesson, you’ll be able to send emails with attachments, integrate with Slack/Microsoft Teams, send SMS alerts, and automate Excel/PDF workflows.
0/9
Day 5 Summary
0/1
Mini Project: Build your own Automation Tool
The project incorporates two common automation tasks – Contact Management and Student Tasks Tracking
0/2
Day 6 Summary
0/1
Introduction to Python Programming (Copy 1)

Reading Text from PDF Files in Python

Learning Objective

By the end of this lab, you will understand how to:

  • Open a PDF file in Python

  • Extract text from its pages

  • Display part of the extracted text

 Reading PDF Text with PyPDF2

Python has a special library called PyPDF2 which allows us to read PDF files.

First thing that we need is to add this module to your program

pip install PyPDF2

In this lesson, we will write a program that extracts text from a PDF.

Code: read_pdf_text.py

# Import the PdfReader class from the PyPDF2 library
from PyPDF2 import PdfReader

def read_pdf_text(pdf_path):
    """
    Extract text from all pages of a PDF file.
    
    Args:
        pdf_path (str): Path to the PDF file
    
    Prints:
        First 500 characters of the extracted text
    """
    
    # Open the PDF file in read-binary mode
    with open(pdf_path, 'rb') as file:
        reader = PdfReader(file)  # Create a PDF reader object
        text = ""  # Store extracted text here

        # Loop through each page in the PDF
        for page in reader.pages:
            text += page.extract_text() + "nn"  # Add page text with spacing

        # Print a preview of the extracted text
        print(f"Extracted text from {pdf_path}:n")
        print(text[:500] + "...")  # Print only the first 500 characters

# Example usage
read_pdf_text("sample.pdf")

Step-by-Step:

  1. Importing the Library

    from PyPDF2 import PdfReader
    
    • PdfReader is a tool from the PyPDF2 library that helps us read PDF files.

  2. Defining a Function

    def read_pdf_text(pdf_path):
    
    • This function takes the file path (location of the PDF) and extracts its text.

  3. Opening the File

    with open(pdf_path, 'rb') as file:
    
    • 'rb' means read in binary mode (needed for PDF files).

    • with open(...) ensures the file closes automatically after use.

  4. Creating a PDF Reader Object

    reader = PdfReader(file)
    
    • PdfReader helps us access the content of the PDF.

  5. Extracting Text

    for page in reader.pages:
        text += page.extract_text() + "nn"
    
    • Loops through each page.

    • page.extract_text() pulls the text from that page.

    • "nn" adds some spacing between pages.

  6. Previewing the Extracted Text

    print(text[:500] + "...")
    
    • Shows only the first 500 characters so the output doesn’t become too long.

  7. Running the Program

    read_pdf_text("sample.pdf")
    
    • Calls the function and tries to read a file named sample.pdf.

Example Output

If your PDF has text, the program will show something like:

Extracted text from sample.pdf:

This is the first page of the PDF...
It contains text that is now extracted using Python...

Key Points

  • Use PyPDF2 to read PDF files.

  • Always open PDF files in 'rb' (read-binary) mode.

  • You can loop through pages and extract their text.

  • Print a preview instead of the entire content for large PDFs.

 

Exercise Files
read_pdf_text.zip
Size: 634.00 B