Competitive Coding

Unit 2: Primality Testing

From naive checks to blazing-fast sieves — master every primality testing algorithm, crack competitive coding problems, and understand the cryptography that secures India's digital payments.

⏱️ 6 hrs theory + 4 hrs practice | 💰 Earning Potential: ₹5,000–₹25,000/month | 📝 30 MCQs (Bloom's Mapped)

💼 Jobs this unlocks: Cryptography Engineer (₹8–15 LPA) | Security Developer (₹6–12 LPA) | Competitive Programmer

Section A

Opening Hook — The Invisible Math Guarding Your Money

🔐 Every UPI Payment You Make is Secured by Prime Numbers

Open PhonePe or Google Pay right now. Send ₹1 to a friend. In that fraction of a second, your phone performed RSA encryption — a cryptographic algorithm that relies entirely on the difficulty of factoring the product of two very large prime numbers (each 300+ digits long).

India's UPI processed 13.89 billion transactions worth ₹20.64 lakh crore in May 2024 alone. Every single one of those transactions was secured by primality testing. The NPCI (National Payments Corporation of India) uses TLS certificates backed by RSA-2048, which depends on 617-digit prime numbers. If someone could efficiently test and factor these primes, they could intercept every payment on PhonePe, Google Pay, Paytm, and BHIM.

What if YOU understood how this works? What if you could implement the same algorithms that protect billions of rupees? That's exactly what this chapter teaches you — from the simplest O(n) check to the blazing-fast Sieve of Eratosthenes used in competitive programming.

🇮🇳 PhonePe🇮🇳 Google Pay🇮🇳 Razorpay🇮🇳 Paytm🇮🇳 NPCI🇮🇳 BHIM

RSA-2048 uses prime numbers with 617 digits each. To crack it by brute force, you'd need more time than the age of the universe — even with every computer on Earth working together. The security of ₹20+ lakh crore in monthly UPI transactions rests on one simple mathematical fact: multiplying two primes is easy, but factoring their product is astronomically hard.

Section B

Learning Outcomes — Bloom's Taxonomy Mapped

Bloom's Level	Learning Outcome
🔵 Remember	Define prime numbers and list their fundamental properties (infinitude, distribution, fundamental theorem of arithmetic)
🔵 Remember	State Fermat's Little Theorem and recall the first 25 prime numbers
🟢 Understand	Explain why checking divisibility only up to √n is sufficient for primality testing, with mathematical proof
🟢 Understand	Describe the Sieve of Eratosthenes algorithm step-by-step and trace it for any given n
🟡 Apply	Implement the O(√n) primality test in both C++ and Python with edge-case handling
🟡 Apply	Code the Sieve of Eratosthenes to generate all primes up to N = 10⁷
🟠 Analyze	Compare time complexities of naive O(n), O(√n), Fermat, and Sieve methods for different input ranges
🟠 Analyze	Analyze why Carmichael numbers are problematic for the Fermat primality test and identify examples
🔴 Evaluate	Assess when to use deterministic vs probabilistic primality tests based on problem constraints
🔴 Evaluate	Evaluate Sieve of Eratosthenes vs Segmented Sieve trade-offs for different memory and range requirements
🟣 Create	Design a comprehensive prime-testing library that combines multiple algorithms with automatic method selection
🟣 Create	Build a segmented sieve for arbitrary range [L, R] queries where R can be up to 10⁹

Section C

Concept Explanation — Primality Testing from Scratch

1. Introduction to Primality Testing

What is a prime number? A prime number is a natural number greater than 1 that has no positive divisors other than 1 and itself. The first few primes are: 2, 3, 5, 7, 11, 13, 17, 19, 23, 29, 31, 37, 41, 43, 47.

Note that 2 is the only even prime. Every even number greater than 2 is divisible by 2, so it cannot be prime. Also, 1 is NOT prime by convention (it has only one divisor, not two).

🔢 Why Do Primes Matter?

1. Cryptography (RSA): RSA encryption relies on the product of two large primes. Multiplying them is easy (milliseconds), but factoring the result takes billions of years. This asymmetry secures all online banking, UPI payments, and HTTPS websites.

2. Hash Tables: Using prime numbers as hash table sizes reduces collisions. Languages like Java use prime-based hash functions internally.

3. Competitive Programming: Primality testing appears in 20%+ of competitive coding problems. Platforms like Codeforces, CodeChef, and LeetCode regularly feature prime-based problems.

4. Random Number Generation: Many pseudorandom number generators (PRNGs) use prime moduli for better distribution.

Naive O(n) Primality Test

The simplest approach: check if any number from 2 to n-1 divides n. If yes, n is not prime.

C++
// Naive primality test — O(n) time complexity
#include <iostream>
using namespace std;

bool isPrime(int n) {
    if (n <= 1) return false;
    for (int i = 2; i < n; i++) {
        if (n % i == 0) return false;
    }
    return true;
}

int main() {
    int n;
    cout << "Enter a number: ";
    cin >> n;
    if (isPrime(n))
        cout << n << " is PRIME" << endl;
    else
        cout << n << " is NOT prime" << endl;
    return 0;
}

Python
# Naive primality test — O(n) time complexity
def is_prime(n):
    if n <= 1:
        return False
    for i in range(2, n):
        if n % i == 0:
            return False
    return True

n = int(input("Enter a number: "))
if is_prime(n):
    print(f"{n} is PRIME")
else:
    print(f"{n} is NOT prime")

This O(n) approach is WAY too slow for large numbers. For n = 10⁹ (1 billion), this loop runs 1 billion times. At ~10⁸ operations/second, that's 10 seconds — far too slow for competitive coding where time limits are typically 1-2 seconds.

2. O(√n) Optimized Primality Test

Key Insight: If n = a × b where both a and b are greater than √n, then a × b > √n × √n = n, which is a contradiction. Therefore, if n has any factor other than 1 and itself, at least one factor must be ≤ √n.

📐 Mathematical Proof: Why √n is Sufficient

Theorem: If n is composite, then n has a prime factor p where p ≤ √n.

Proof: Let n be composite. Then n = a × b where 1 < a ≤ b < n. Since a ≤ b, we have a × a ≤ a × b = n, therefore a ≤ √n. Since a > 1 and a divides n, a has a prime factor p that also divides n, and p ≤ a ≤ √n. ∎

Practical impact: For n = 10⁹, instead of checking 10⁹ divisors, we check only √(10⁹) ≈ 31,623 divisors. That's a 31,600× speedup!

C++
// O(√n) primality test — optimized
#include <iostream>
#include <cmath>
using namespace std;

bool isPrime(int n) {
    if (n <= 1) return false;
    if (n <= 3) return true;
    if (n % 2 == 0 || n % 3 == 0) return false;
    for (int i = 5; i * i <= n; i += 6) {
        if (n % i == 0 || n % (i + 2) == 0)
            return false;
    }
    return true;
}

int main() {
    int n;
    cout << "Enter number: ";
    cin >> n;
    cout << n << (isPrime(n) ? " is PRIME" : " is NOT prime") << endl;
    return 0;
}

Python
# O(√n) primality test — optimized with 6k±1 trick
def is_prime(n):
    if n <= 1: return False
    if n <= 3: return True
    if n % 2 == 0 or n % 3 == 0: return False
    i = 5
    while i * i <= n:
        if n % i == 0 or n % (i + 2) == 0:
            return False
        i += 6
    return True

The 6k±1 optimization: All primes greater than 3 are of the form 6k±1. Why? Because any integer can be written as 6k, 6k+1, 6k+2, 6k+3, 6k+4, or 6k+5. Of these, 6k is divisible by 6, 6k+2 and 6k+4 by 2, and 6k+3 by 3. So only 6k+1 and 6k+5 (= 6(k+1)-1) can be prime. This lets us skip 2/3 of all candidates!

3. Factorization of a Number

Trial division finds all prime factors by repeatedly dividing by the smallest possible factor. The optimized approach runs in O(√n) time.

C++
// Find ALL prime factors of n — O(√n)
#include <iostream>
#include <vector>
using namespace std;

vector<int> primeFactors(int n) {
    vector<int> factors;
    while (n % 2 == 0) {
        factors.push_back(2);
        n /= 2;
    }
    for (int i = 3; i * i <= n; i += 2) {
        while (n % i == 0) {
            factors.push_back(i);
            n /= i;
        }
    }
    if (n > 1) factors.push_back(n);
    return factors;
}

int main() {
    int n = 84;
    auto f = primeFactors(n);
    cout << "Prime factors of " << n << ": ";
    for (int x : f) cout << x << " ";
    return 0;
}

Python
# Find ALL prime factors of n — O(√n)
def prime_factors(n):
    factors = []
    while n % 2 == 0:
        factors.append(2)
        n //= 2
    i = 3
    while i * i <= n:
        while n % i == 0:
            factors.append(i)
            n //= i
        i += 2
    if n > 1:
        factors.append(n)
    return factors

print(prime_factors(84))  # [2, 2, 3, 7]

4. Finding Prime Factors by Taking Square Root

📝 Worked Example: n = 84

Step 1: 84 ÷ 2 = 42 → Factor found: 2

Step 2: 42 ÷ 2 = 21 → Factor found: 2

Step 3: 21 ÷ 3 = 7 → Factor found: 3

Step 4: 7 > 1 and √7 ≈ 2.6 < 3 (next candidate), so 7 is prime → Factor found: 7

Result: 84 = 2² × 3¹ × 7¹

Prime factors of 84: 2 2 3 7 84 = 2^2 × 3^1 × 7^1

5. Fermat Primality Test

🔬 Fermat's Little Theorem

Statement: If p is a prime number and a is any integer not divisible by p, then:

a^p−1 ≡ 1 (mod p)

Example: Let p = 7, a = 2. Then 2⁶ = 64. And 64 mod 7 = 1. ✓ Confirmed!

Contrapositive: If aⁿ⁻¹ ≢ 1 (mod n), then n is definitely composite.

The Test: Pick random values of a. If any gives aⁿ⁻¹ ≢ 1 (mod n), n is composite. If all pass, n is probably prime.

C++
// Fermat Primality Test — Probabilistic
#include <iostream>
#include <cstdlib>
using namespace std;

long long power(long long base, long long exp, long long mod) {
    long long result = 1;
    base %= mod;
    while (exp > 0) {
        if (exp % 2 == 1)
            result = (result * base) % mod;
        exp /= 2;
        base = (base * base) % mod;
    }
    return result;
}

bool fermatTest(int n, int k) {
    if (n <= 1) return false;
    if (n <= 3) return true;
    for (int i = 0; i < k; i++) {
        int a = 2 + rand() % (n - 3);
        if (power(a, n - 1, n) != 1)
            return false;
    }
    return true;
}

Python
# Fermat Primality Test — Probabilistic
import random

def fermat_test(n, k=10):
    if n <= 1: return False
    if n <= 3: return True
    for _ in range(k):
        a = random.randint(2, n - 2)
        if pow(a, n - 1, n) != 1:
            return False
    return True

print(fermat_test(7))    # True
print(fermat_test(561))  # True! (FALSE POSITIVE — Carmichael number)
print(fermat_test(15))   # False

Beware of Carmichael numbers! These are composite numbers that pass the Fermat test for ALL coprime bases. Examples: 561 = 3×11×17, 1105 = 5×13×17, 1729 = 7×13×19 (also the Hardy-Ramanujan number!). The Fermat test will say these are "prime" — but they're not. For competitive coding, prefer the deterministic O(√n) test or Miller-Rabin.

6. Sieve of Eratosthenes ⭐ MOST IMPORTANT

The Sieve of Eratosthenes is the single most important algorithm in this entire unit. It generates ALL prime numbers up to n in O(n log log n) time. If you learn only one thing from this chapter, learn this.

Think of the Sieve like filtering wheat at a kirana (grocery) store. You start with a big pile of grains (numbers 2 to n). You pass it through a sieve — the first sieve removes all chaff divisible by 2, the next removes chaff divisible by 3, then 5, and so on. What's left after all the filtering? Only the pure wheat — the prime numbers!

📊 Algorithm Step-by-Step

Step 1: Create a boolean array is_prime[0..n], initialized to true.

Step 2: Mark is_prime[0] and is_prime[1] as false (not prime).

Step 3: Start with p = 2 (first prime).

Step 4: Mark all multiples of p (starting from p²) as false.

Step 5: Find the next unmarked number — that's the next prime.

Step 6: Repeat until p² > n.

Step 7: All remaining true entries are primes!

ASCII Visualization: Sieve for n = 30

Step	Numbers 2–30 (✓ = prime candidate, ✗ = crossed out)
Initial	✓2 ✓3 ✓4 ✓5 ✓6 ✓7 ✓8 ✓9 ✓10 ✓11 ✓12 ✓13 ✓14 ✓15 ✓16 ✓17 ✓18 ✓19 ✓20 ✓21 ✓22 ✓23 ✓24 ✓25 ✓26 ✓27 ✓28 ✓29 ✓30
p=2 Cross 4,6,8…30	✓2 ✓3 ✗4 ✓5 ✗6 ✓7 ✗8 ✓9 ✗10 ✓11 ✗12 ✓13 ✗14 ✓15 ✗16 ✓17 ✗18 ✓19 ✗20 ✓21 ✗22 ✓23 ✗24 ✓25 ✗26 ✓27 ✗28 ✓29 ✗30
p=3 Cross 9,15,21,27	✓2 ✓3 ✗4 ✓5 ✗6 ✓7 ✗8 ✗9 ✗10 ✓11 ✗12 ✓13 ✗14 ✗15 ✗16 ✓17 ✗18 ✓19 ✗20 ✗21 ✗22 ✓23 ✗24 ✓25 ✗26 ✗27 ✗28 ✓29 ✗30
p=5 Cross 25	✓2 ✓3 ✗4 ✓5 ✗6 ✓7 ✗8 ✗9 ✗10 ✓11 ✗12 ✓13 ✗14 ✗15 ✗16 ✓17 ✗18 ✓19 ✗20 ✗21 ✗22 ✓23 ✗24 ✗25 ✗26 ✗27 ✗28 ✓29 ✗30
Done! √30 ≈ 5.47	Primes: 2, 3, 5, 7, 11, 13, 17, 19, 23, 29 (10 primes found)

C++
// Sieve of Eratosthenes — O(n log log n)
#include <iostream>
#include <vector>
using namespace std;

vector<int> sieve(int n) {
    vector<bool> is_prime(n + 1, true);
    is_prime[0] = is_prime[1] = false;
    for (int p = 2; p * p <= n; p++) {
        if (is_prime[p]) {
            for (int j = p * p; j <= n; j += p)
                is_prime[j] = false;
        }
    }
    vector<int> primes;
    for (int i = 2; i <= n; i++)
        if (is_prime[i]) primes.push_back(i);
    return primes;
}

int main() {
    auto primes = sieve(30);
    cout << "Primes up to 30: ";
    for (int p : primes) cout << p << " ";
    return 0;
}

Python
# Sieve of Eratosthenes — O(n log log n)
def sieve(n):
    is_prime = [True] * (n + 1)
    is_prime[0] = is_prime[1] = False
    p = 2
    while p * p <= n:
        if is_prime[p]:
            for j in range(p * p, n + 1, p):
                is_prime[j] = False
        p += 1
    return [i for i in range(2, n + 1) if is_prime[i]]

print(sieve(30))
# [2, 3, 5, 7, 11, 13, 17, 19, 23, 29]

Primes up to 30: 2 3 5 7 11 13 17 19 23 29

⏱️ Time Complexity: O(n log log n)

The total work done is: n/2 + n/3 + n/5 + n/7 + n/11 + ... (sum over all primes p ≤ n).

This equals n × Σ(1/p) for primes p ≤ n. By Mertens' theorem, this sum ≈ log(log n).

So total operations ≈ O(n log log n). For n = 10⁷, log log n ≈ 3, so it's essentially O(3n) — nearly linear!

Space complexity: O(n) for the boolean array.

Why start marking from p² instead of 2p? Because all composites smaller than p² that are multiples of p have already been marked by smaller primes. For example, when p=5, the composite 10 = 2×5 was already marked when p=2, and 15 = 3×5 was already marked when p=3. The first unmarked multiple of 5 is 5² = 25.

7. Segmented Sieve

When n is very large (say 10⁹), a regular sieve needs a boolean array of 10⁹ entries — about 1 GB of RAM. Most competitive programming judges allow only 256 MB. Solution: segmented sieve.

🔧 How Segmented Sieve Works

Step 1: Generate all primes up to √R using the regular sieve (these "small primes" fit easily in memory).

Step 2: Divide the range [L, R] into segments of size ~√R.

Step 3: For each segment, use the small primes to mark composites.

Step 4: Collect unmarked numbers — those are primes in the range.

Memory: O(√R) instead of O(R). For R = 10⁹, that's ~31,623 instead of 10⁹!

C++
// Segmented Sieve — find primes in range [L, R]
#include <iostream>
#include <vector>
#include <cmath>
using namespace std;

vector<long long> segmentedSieve(long long L, long long R) {
    int limit = (int)sqrt((double)R) + 1;
    vector<bool> mark(limit + 1, true);
    vector<int> small_primes;
    for (int p = 2; p <= limit; p++) {
        if (mark[p]) {
            small_primes.push_back(p);
            for (int j = p * p; j <= limit; j += p)
                mark[j] = false;
        }
    }
    vector<bool> is_prime(R - L + 1, true);
    for (int p : small_primes) {
        long long start = max((long long)p * p, ((L + p - 1) / p) * p);
        for (long long j = start; j <= R; j += p)
            is_prime[j - L] = false;
    }
    if (L == 1) is_prime[0] = false;
    vector<long long> result;
    for (long long i = 0; i <= R - L; i++)
        if (is_prime[i]) result.push_back(L + i);
    return result;
}

Python
# Segmented Sieve — find primes in [L, R]
import math

def segmented_sieve(L, R):
    limit = int(math.sqrt(R)) + 1
    # Step 1: small primes via regular sieve
    mark = [True] * (limit + 1)
    small_primes = []
    for p in range(2, limit + 1):
        if mark[p]:
            small_primes.append(p)
            for j in range(p*p, limit+1, p):
                mark[j] = False
    # Step 2: sieve the range [L, R]
    is_prime = [True] * (R - L + 1)
    for p in small_primes:
        start = max(p * p, ((L + p - 1) // p) * p)
        for j in range(start, R + 1, p):
            is_prime[j - L] = False
    if L == 1: is_prime[0] = False
    return [L + i for i in range(R - L + 1) if is_prime[i]]

print(segmented_sieve(100, 150))
# [101, 103, 107, 109, 113, 127, 131, 137, 139, 149]

8. Sieve of Atkins

The Sieve of Atkins is a modern algorithm that uses quadratic forms and modular arithmetic. It has a theoretical time complexity of O(n / log log n), slightly better than Eratosthenes for very large n.

🔬 Three Quadratic Forms

The algorithm toggles entries based on solutions to three equations:

Form 1: 4x² + y² ≡ 1 (mod 4) — for n ≡ 1 (mod 4)

Form 2: 3x² + y² ≡ 7 (mod 12) — for n ≡ 7 (mod 12)

Form 3: 3x² − y² (where x > y) ≡ 11 (mod 12) — for n ≡ 11 (mod 12)

After toggling, eliminate all multiples of squares of primes.

When to use: Only for very large n (> 10⁸) where the constant factor improvement matters. For most competitive programming, Eratosthenes is preferred due to simplicity.

C++
// Sieve of Atkins — Overview Implementation
#include <iostream>
#include <vector>
#include <cmath>
using namespace std;

vector<int> sieveOfAtkins(int limit) {
    vector<bool> sieve(limit + 1, false);
    if (limit > 2) sieve[2] = true;
    if (limit > 3) sieve[3] = true;
    for (int x = 1; x * x <= limit; x++) {
        for (int y = 1; y * y <= limit; y++) {
            int n = 4 * x * x + y * y;
            if (n <= limit && (n % 12 == 1 || n % 12 == 5))
                sieve[n] = !sieve[n];
            n = 3 * x * x + y * y;
            if (n <= limit && n % 12 == 7)
                sieve[n] = !sieve[n];
            n = 3 * x * x - y * y;
            if (x > y && n <= limit && n % 12 == 11)
                sieve[n] = !sieve[n];
        }
    }
    for (int r = 5; r * r <= limit; r++)
        if (sieve[r])
            for (int i = r * r; i <= limit; i += r * r)
                sieve[i] = false;
    vector<int> primes;
    for (int i = 2; i <= limit; i++)
        if (sieve[i]) primes.push_back(i);
    return primes;
}

9. Mansi and Her Series

Problem: Given N, generate first N terms of a series where: if index i is prime, term[i] = i; otherwise, term[i] = smallest prime factor of i.

C++
// Mansi's Series using Smallest Prime Factor (SPF) sieve
#include <iostream>
#include <vector>
using namespace std;

int main() {
    int N = 10;
    vector<int> spf(N + 1);
    for (int i = 0; i <= N; i++) spf[i] = i;
    for (int i = 2; i * i <= N; i++) {
        if (spf[i] == i) {
            for (int j = i * i; j <= N; j += i)
                if (spf[j] == j) spf[j] = i;
        }
    }
    cout << "Mansi's Series: ";
    for (int i = 1; i <= N; i++)
        cout << spf[i] << " ";
    return 0;
}

Mansi's Series: 1 2 3 2 5 2 7 2 3 2

10. Collections of Pens

Problem: Rohan has N pens. He wants to distribute them equally among groups. Find the total number of ways (= number of divisors of N).

C++
// Count divisors using prime factorization
#include <iostream>
using namespace std;

int countDivisors(int n) {
    int count = 1;
    for (int i = 2; i * i <= n; i++) {
        int exp = 0;
        while (n % i == 0) { exp++; n /= i; }
        count *= (exp + 1);
    }
    if (n > 1) count *= 2;
    return count;
}

int main() {
    int n = 12;
    cout << "N = " << n << ", Ways = " << countDivisors(n) << endl;
    // 12 = 2^2 × 3^1 → (2+1)(1+1) = 6
    return 0;
}

N = 12, Ways = 6 (Groups of: 1, 2, 3, 4, 6, or 12 pens each)

11. Next Prime Palindrome

C++
// Find next number that is both prime AND palindrome
#include <iostream>
#include <string>
#include <algorithm>
using namespace std;

bool isPrime(int n) {
    if (n <= 1) return false;
    if (n <= 3) return true;
    if (n % 2 == 0 || n % 3 == 0) return false;
    for (int i = 5; i * i <= n; i += 6)
        if (n % i == 0 || n % (i+2) == 0) return false;
    return true;
}

bool isPalindrome(int n) {
    string s = to_string(n);
    string r = s;
    reverse(r.begin(), r.end());
    return s == r;
}

int nextPrimePalindrome(int n) {
    int candidate = n + 1;
    while (true) {
        if (isPalindrome(candidate) && isPrime(candidate))
            return candidate;
        candidate++;
    }
}

int main() {
    cout << "Next prime palindrome after 100: " << nextPrimePalindrome(100) << endl;
    cout << "Next prime palindrome after 7: " << nextPrimePalindrome(7) << endl;
    return 0;
}

Next prime palindrome after 100: 101 Next prime palindrome after 7: 11

Section D

Learn by Doing — 3-Tier Lab Structure

🟢 Tier 1 — GUIDED: Implement O(√n) Primality Test

⏱️ 45–60 minutesBeginnerStep-by-step instructions

Step 1: Handle edge cases

n ≤ 1 → not prime. n = 2 or n = 3 → prime.

Step 2: Check divisibility by 2 and 3

If n % 2 == 0 or n % 3 == 0 → not prime.

Step 3: Check 6k±1 up to √n

Loop i from 5, step 6. Check n % i and n % (i+2).

Step 4: Test with these inputs

Test: 1, 2, 3, 4, 17, 25, 97, 100, 7919, 1000000007

1 → NOT prime 2 → PRIME 3 → PRIME 4 → NOT prime 17 → PRIME 25 → NOT prime 97 → PRIME 100 → NOT prime 7919 → PRIME 1000000007 → PRIME

🟡 Tier 2 — SEMI-GUIDED: Sieve of Eratosthenes

⏱️ 60–90 minutesIntermediateHints provided

Your Mission:

Generate all primes up to 10⁶ and count them.

Hints:

Create a boolean array of size 10⁶ + 1, initialized to true
Mark 0 and 1 as false
For each p from 2 where p² ≤ n, mark multiples from p² as false
Count and collect all remaining true entries

Stretch Goal: Count twin primes (pairs p, p+2 where both are prime) up to 10⁶. Expected answer: 8169 twin prime pairs.

🔴 Tier 3 — OPEN CHALLENGE: Segmented Sieve for [L, R]

⏱️ 90–120 minutesAdvancedNo instructions — real contest problem

The Brief:

Given L and R (up to 10⁹, R − L ≤ 10⁶), find and print all primes in the range [L, R]. Your solution must use O(√R) memory.

Test case: L=999999000000, R=1000000000000. How many primes are in this range?

Section E

Problem Set — Practice Questions

Tracing Questions (5)

T1. Trace the Sieve of Eratosthenes for n = 20. Show the array state after each prime's multiples are crossed out.

T2. Trace the O(√n) algorithm for n = 97. List each divisor checked and the final result.

T3. Trace the Fermat test for n = 561 with bases a = 2, 3, 5. Show each computation a^(n-1) mod n.

T4. Trace the prime factorization of n = 360. Show each division step.

T5. Trace the segmented sieve for the range [10, 30] using small primes {2, 3, 5}.

Programming Problems (8)

P1. Count the number of primes up to N. Input: N = 10⁶. Expected output: 78498.

P2. Find the Nth prime number. Input: N = 10001. Expected output: 104743.

P3. Check if a number is a Mersenne prime (of the form 2ᵖ − 1). Test for p = 2, 3, 5, 7, 11, 13.

P4. Find all prime factors of N and print in exponential form. Input: 360 → Output: 2³ × 3² × 5¹.

P5. Generate all twin primes up to N. Twin primes: (3,5), (5,7), (11,13), (17,19), ...

P6. Find the largest prime factor of N. Input: 600851475143. (This is Project Euler Problem 3!)

P7. Count prime palindromes up to N. Input: N = 1000. Expected output: 5 (2, 3, 5, 7, 11).

P8. Sum of all primes up to N using Sieve. Input: N = 2×10⁶. Verify your answer.

Industry Application Problems (3)

I1. RSA Key Generation: Given primes p = 61, q = 53, compute n = p×q, φ(n) = (p-1)(q-1), find e coprime to φ(n), compute d = e⁻¹ mod φ(n). Encrypt and decrypt the message m = 65.

I2. Hash Table Sizing: Write a function that, given N (desired table size), returns the smallest prime ≥ N. This is used in languages like C++ (unordered_map) and Java (HashMap) internally.

I3. Checksum Verification: Implement a prime-based rolling hash for a string. Use a prime modulus (10⁹ + 7) and prime base (31). Verify string integrity after transmission.

Interview Questions (3)

Q1. (Google-style): Given an array of N numbers (each ≤ 10⁶), count how many are prime. Optimize using sieve preprocessing.

Q2. (Amazon-style): Find the k-th prime number efficiently. Can you do better than checking each number?

Q3. (Microsoft-style): Given array A of N integers, count pairs (i, j) where i < j and A[i] + A[j] is prime. Optimize using Goldbach-related observations.

Section F

MCQ Assessment Bank — 30 Questions (Bloom's Mapped)

Remember / Identify (Q1–Q5)

Which of the following is a prime number?

Remember

✅ Answer: (B) 67 — 51 = 3×17, 91 = 7×13, 87 = 3×29. Only 67 has no factors other than 1 and itself.

What is the only even prime number?

Remember

✅ Answer: (B) 2 — It is the smallest and only even prime. All other even numbers are divisible by 2.

Fermat's Little Theorem states that if p is prime, then a^(p-1) mod p equals:

Remember

✅ Answer: (B) 1 — For any a not divisible by p, a^(p-1) ≡ 1 (mod p).

The time complexity of the Sieve of Eratosthenes is:

O(n²)
O(n log n)
O(n log log n)
O(n√n)

Remember

✅ Answer: (C) O(n log log n) — Due to the harmonic series of prime reciprocals (Mertens' theorem).

Which of these is a Carmichael number?

Remember

✅ Answer: (B) 561 = 3 × 11 × 17 — The smallest Carmichael number. It passes Fermat's test for all coprime bases despite being composite.

Understand / Explain (Q6–Q10)

Why is checking divisors only up to √n sufficient for primality testing?

All prime factors are less than √n
If n has a factor greater than √n, it must have a corresponding factor less than √n
√n is always a prime number
It reduces memory usage

Understand

✅ Answer: (B) — If n = a × b and both a, b > √n, then a × b > n, which is a contradiction. So at least one factor must be ≤ √n.

Why does the Sieve of Eratosthenes start marking from p² instead of 2p?

To save memory
Because all smaller multiples of p have already been marked by smaller primes
p² is always composite
It's an arbitrary optimization

Understand

✅ Answer: (B) — Multiples 2p, 3p, ..., (p-1)p were already marked when processing primes 2, 3, ..., p-1.

Why is the Fermat test called "probabilistic"?

It uses random number generation internally
It can give false positives (declare composite numbers as prime)
It runs in random time
It only works for random inputs

Understand

✅ Answer: (B) — Carmichael numbers pass the Fermat test but are composite, making it probabilistic. More iterations reduce (but don't eliminate) false positive probability.

What is the main advantage of a segmented sieve over a regular sieve?

It is faster
It uses significantly less memory
It finds more primes
It works with negative numbers

Understand

✅ Answer: (B) — Regular sieve needs O(n) memory. Segmented sieve needs only O(√n), making it practical for n up to 10⁹ and beyond.

Q10

In the context of RSA encryption, why are large primes essential?

Large primes make the key look impressive
The security depends on the difficulty of factoring the product of two large primes
Small primes cannot be multiplied
Large primes are faster to compute with

Understand

✅ Answer: (B) — RSA security relies on the computational intractability of factoring n = p × q when p and q are very large primes (300+ digits each).

Apply (Q11–Q15)

Q11

What is the output of the Sieve of Eratosthenes for n = 10?

2, 3, 5, 7, 9
2, 3, 5, 7
1, 2, 3, 5, 7
2, 4, 6, 8, 10

Apply

✅ Answer: (B) 2, 3, 5, 7 — These are the four primes ≤ 10. 9 = 3×3 is composite; 1 is not prime by definition.

Q12

The prime factorization of 360 is:

2³ × 3² × 5
2² × 3³ × 5
2³ × 3 × 5²
4 × 9 × 10

Apply

✅ Answer: (A) 2³ × 3² × 5 — 360 = 8 × 45 = 8 × 9 × 5 = 2³ × 3² × 5¹.

Q13

How many prime numbers are between 1 and 30?

Apply

✅ Answer: (B) 10 — Primes: 2, 3, 5, 7, 11, 13, 17, 19, 23, 29.

Q14

For Fermat test with n=7 and a=3, what is 3^6 mod 7?

Apply

✅ Answer: (B) 1 — 3⁶ = 729. 729 ÷ 7 = 104 remainder 1. Since result is 1, the test says 7 is probably prime (and it is!).

Q15

What is the next prime palindrome after 10?

Apply

✅ Answer: (A) 11 — 11 is both prime (no factors other than 1 and 11) and a palindrome (reads same forwards and backwards).

Analyze (Q16–Q20)

Q16

For checking if a single number n is prime, which method is most efficient?

Sieve of Eratosthenes
O(√n) trial division
Naive O(n) check
Segmented sieve

Analyze

✅ Answer: (B) — For a single query, O(√n) is optimal. Sieve is better only when you need to check multiple numbers up to n.

Q17

Why is 1729 special in number theory?

It is the largest known prime
It is both a Carmichael number and the Hardy-Ramanujan number
It is the only number divisible by all single-digit primes
It is the sum of all primes up to 100

Analyze

✅ Answer: (B) — 1729 = 7 × 13 × 19 (Carmichael number) and is also 1³ + 12³ = 9³ + 10³ (Hardy-Ramanujan: smallest number expressible as sum of two cubes in two ways).

Q18

What happens if we use the Sieve of Eratosthenes for n = 10⁹?

It works perfectly
It runs out of memory (needs ~1 GB for boolean array)
It produces wrong results
It takes exactly 1 second

Analyze

✅ Answer: (B) — A boolean array of 10⁹ elements needs ~1 GB RAM, exceeding typical memory limits. Use segmented sieve instead.

Q19

Compare the number of operations for O(√n) vs Sieve for finding all primes up to n = 10⁶:

√n is faster for this task
Sieve is faster because it processes all numbers at once with O(n log log n)
They take the same time
Neither can handle 10⁶

Analyze

✅ Answer: (B) — Testing each number individually: 10⁶ × √10⁶ = 10⁹ operations. Sieve: ~3 × 10⁶ operations. Sieve is ~333× faster for this task.

Q20

Which data structure is used internally by the Sieve of Eratosthenes?

Linked list
Binary search tree
Boolean array
Hash map

Analyze

✅ Answer: (C) — A boolean array where index i represents whether i is prime. Simple, cache-friendly, and efficient.

Evaluate (Q21–Q25)

Q21

For a competitive coding problem asking "Is N prime?" where N ≤ 10¹⁸, which method is best?

Sieve of Eratosthenes
Naive O(n) check
Miller-Rabin probabilistic test
Segmented sieve

Evaluate

✅ Answer: (C) — For N up to 10¹⁸, √N ≈ 10⁹ which is too slow for trial division. Miller-Rabin runs in O(k log² n) and is the standard choice for very large single-number primality tests.

Q22

When should you use Sieve of Atkins instead of Eratosthenes?

Always — it is strictly better
Never — Eratosthenes is always better
Only for very large n (>10⁸) where the slight theoretical advantage matters
Only for small n (<100)

Evaluate

✅ Answer: (C) — Atkins has a better asymptotic complexity O(n/log log n) vs O(n log log n), but larger constant factors. In practice, optimized Eratosthenes is often faster. Atkins is mainly of theoretical interest.

Q23

A student claims "If a number passes the Fermat test with 100 random bases, it is definitely prime." Is this correct?

Yes, 100 bases is more than enough
No, Carmichael numbers pass for all coprime bases, so even 100 tests cannot guarantee primality
Yes, but only for odd numbers
No, the Fermat test never works

Evaluate

✅ Answer: (B) — Carmichael numbers (like 561, 1105, 1729) pass the Fermat test for every coprime base. No number of Fermat tests can distinguish them from primes. Use Miller-Rabin instead.

Q24

For finding primes in range [10⁹, 10⁹+10⁶], the best approach is:

Regular Sieve up to 10⁹
Check each number with O(√n)
Segmented Sieve
Fermat test on each number

Evaluate

✅ Answer: (C) — Regular sieve can't handle 10⁹ (memory). Individual √n tests: 10⁶ × √(10⁹) ≈ 3×10¹⁰ (too slow). Segmented sieve: ~10⁶ operations with O(√10⁹) ≈ 31K memory. Perfect!

Q25

Which property makes prime numbers useful for hash table sizes?

Prime numbers are always odd
Using a prime modulus distributes hash values more uniformly, reducing collisions
Prime numbers are faster to compute
Prime numbers use less memory

Evaluate

✅ Answer: (B) — When the table size is prime, hash(key) % size distributes keys more uniformly because primes have no common factors with typical key patterns, reducing collision clustering.

Create / Design (Q26–Q30)

Q26

To build a function that checks primality for multiple queries where each n ≤ 10⁷, the best preprocessing is:

No preprocessing; check each query with O(√n)
Precompute a sieve up to 10⁷ and answer each query in O(1)
Store all primes in a hash set
Use Fermat test for each query

Create

✅ Answer: (B) — Sieve preprocessing takes O(n log log n) once, then each query is O(1) lookup. Far better than O(√n) per query when there are many queries.

Q27

To count divisors of N efficiently, the best approach combines:

Checking all numbers from 1 to N
Prime factorization + divisor count formula: product of (exponent+1)
Using Sieve of Eratosthenes
Binary search

Create

✅ Answer: (B) — Find prime factorization N = p₁^a₁ × p₂^a₂ × ... Then divisor count = (a₁+1)(a₂+1)... This runs in O(√N).

Q28

To generate the Smallest Prime Factor (SPF) for all numbers up to N, you should modify the sieve to:

Store the first prime that marks each composite number
Count how many times each number is marked
Use a linked list instead of an array
Run the sieve backwards

Create

✅ Answer: (A) — Initialize SPF[i] = i. When marking composites of prime p, only update SPF[j] = p if SPF[j] == j (meaning j hasn't been marked by a smaller prime yet).

Q29

To design an efficient "next prime" function, the best strategy for large N (>10⁶) is:

Increment N and test each with O(√n) until a prime is found
Use the Prime Number Theorem to jump approximately ln(N) ahead, then search
Generate all primes up to 2N with a sieve
Use Fermat test only

Create

✅ Answer: (A) — By the Prime Number Theorem, the gap between consecutive primes near N is approximately ln(N). For N = 10⁶, that's about 14. So O(√n) testing of ~14 candidates is fast enough.

Q30

To implement a secure random prime generator for a simplified RSA system, you need:

Any random odd number
A random number generator + Miller-Rabin test with sufficient iterations
The largest known Mersenne prime
Sequential search starting from 2

Create

✅ Answer: (B) — Generate a random number of desired bit length, ensure it's odd, then run Miller-Rabin with ~40 iterations (probability of false positive < 2⁻⁸⁰). This is how OpenSSL generates RSA primes.

Section G

Short Answer Questions (8)

Q1. Define primality testing and explain its importance in computer science.

Primality testing is the process of determining whether a given natural number is prime (divisible only by 1 and itself). It is fundamental to computer science for several reasons: (1) Cryptography — RSA encryption, which secures all online transactions including UPI payments, relies on large primes. (2) Hash tables — prime-sized tables reduce collisions. (3) Random number generation — many PRNGs use prime moduli. (4) Competitive programming — primality problems appear frequently on Codeforces, CodeChef, and LeetCode. (5) Error-correcting codes — Reed-Solomon codes use prime field arithmetic. Understanding efficient primality testing is essential for any serious programmer or computer scientist.

Q2. Why is checking divisibility up to √n sufficient? Prove it.

Proof: Suppose n is composite, meaning n = a × b where 1 < a, b < n. Assume for contradiction that both a > √n and b > √n. Then a × b > √n × √n = n, contradicting n = a × b. Therefore, at least one of a or b must be ≤ √n. Since any composite number n has at least one factor ≤ √n, we only need to check potential divisors from 2 to √n. If none divides n, then n must be prime. Impact: For n = 10⁹, this reduces checks from ~10⁹ to ~31,623 — a speedup of over 31,000×.

Q3. Explain Fermat's Little Theorem with an example.

Fermat's Little Theorem: If p is a prime number and a is any integer with gcd(a, p) = 1, then a^(p−1) ≡ 1 (mod p). Example: Let p = 7 and a = 2. Compute 2^(7−1) = 2⁶ = 64. Now 64 mod 7 = 64 − 63 = 1. Since the result is 1, the theorem holds. ✓ Another example: p = 11, a = 3. 3¹⁰ = 59049. 59049 mod 11 = 59049 − 5368×11 = 59049 − 59048 = 1. ✓ The contrapositive is used for testing: if a^(n−1) mod n ≠ 1, then n is definitely composite.

Q4. What are Carmichael numbers? Give 3 examples.

Carmichael numbers are composite numbers that satisfy Fermat's Little Theorem for ALL bases coprime to them. They are "Fermat liars" — they fool the Fermat primality test into thinking they are prime. Examples: (1) 561 = 3 × 11 × 17 — the smallest Carmichael number. (2) 1105 = 5 × 13 × 17. (3) 1729 = 7 × 13 × 19 — also the famous Hardy-Ramanujan number. They are problematic because no matter how many random bases you test with Fermat's method, these numbers will always pass. To handle them, use the Miller-Rabin test, which can detect Carmichael numbers.

Q5. Describe the Sieve of Eratosthenes step-by-step for n = 30.

Step 1: Create boolean array [0..30], all set to true. Set arr[0] = arr[1] = false. Step 2 (p=2): Mark multiples 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30 as false. Step 3 (p=3): Mark 9, 15, 21, 27 as false (6, 12, 18, 24, 30 already marked). Step 4 (p=5): Mark 25 as false (other multiples already marked). Step 5: Since 6² = 36 > 30, we stop. Result: Numbers still marked true: 2, 3, 5, 7, 11, 13, 17, 19, 23, 29 — these are all 10 primes ≤ 30.

Q6. Compare time complexity of naive primality test vs Sieve.

Naive test: O(n) per number or O(√n) per number with optimization. To check ALL numbers up to N: N × O(√N) = O(N√N). Sieve of Eratosthenes: O(N log log N) for ALL numbers up to N simultaneously. Comparison: For N = 10⁶: Naive = 10⁶ × 10³ = 10⁹ operations. Sieve = 10⁶ × 3 ≈ 3×10⁶ operations. The sieve is ~333× faster. When to use each: For a single query, √n test is better. For multiple queries or generating all primes up to N, the sieve is vastly superior.

Q7. What is a segmented sieve and when to use it?

A segmented sieve divides the range [L, R] into smaller segments of size ~√R and sieves each segment independently using pre-computed small primes (up to √R). Memory: O(√R) instead of O(R). When to use: When R > 10⁷ (regular sieve exceeds memory limits). For R = 10⁹, regular sieve needs ~1 GB; segmented sieve needs only ~31 KB for small primes plus segment buffer. Use cases: Finding primes in a range [L, R], prime counting in large ranges, problems on SPOJ like PRIME1. Time complexity remains O((R−L+1) log log R + √R).

Q8. Difference between deterministic and probabilistic primality tests.

Deterministic tests (trial division, AKS) give a definitive yes/no answer. Trial division is O(√n) and always correct. AKS runs in polynomial time and is always correct but has large constants. Probabilistic tests (Fermat, Miller-Rabin) are faster but can produce false positives. Fermat can be fooled by Carmichael numbers. Miller-Rabin with k iterations has error probability < 4^(−k) — with k=40, the probability of error is less than 2^(−80), which is negligible. For competitive coding: Deterministic O(√n) for n ≤ 10¹², Miller-Rabin for n > 10¹².

Section H

Long Answer Questions (3)

Q1. Compare all primality testing methods covered in this unit.

Method	Time	Space	Type	Best Use
Naive O(n)	O(n)	O(1)	Deterministic	Educational only
O(√n)	O(√n)	O(1)	Deterministic	Single query, n ≤ 10¹²
Fermat	O(k log n)	O(1)	Probabilistic	Quick check (beware Carmichael)
Miller-Rabin	O(k log² n)	O(1)	Probabilistic	Single query, n ≤ 10¹⁸
Sieve	O(n log log n)	O(n)	Deterministic	All primes up to n ≤ 10⁷
Segmented Sieve	O((R-L)·log log R)	O(√R)	Deterministic	Range queries, R up to 10⁹
Sieve of Atkins	O(n/log log n)	O(n)	Deterministic	Theoretical; very large n

The choice depends on the problem: single query vs multiple queries, range of n, memory constraints, and whether deterministic answers are required. For most competitive programming, O(√n) for single queries and Sieve of Eratosthenes for bulk generation cover 95% of problems.

Q2. Explain the Sieve of Eratosthenes in complete detail.

The Sieve of Eratosthenes, invented by the Greek mathematician Eratosthenes around 240 BC, is an algorithm to find all prime numbers up to a given limit n. It works by iteratively marking the multiples of each prime starting from 2.

Mathematical Basis: The Fundamental Theorem of Arithmetic states every integer > 1 has a unique prime factorization. The sieve exploits the fact that any composite number must have a prime factor ≤ √n.

Optimization — start from p²: When processing prime p, we start marking from p² (not 2p) because all smaller multiples kp where k < p have already been marked by the prime factor k. This reduces the total work significantly.

Time Complexity Analysis: The inner loop for prime p does n/p operations. Total work = Σ(n/p) for all primes p ≤ √n = n × Σ(1/p). By Mertens' theorem, Σ(1/p) for primes p ≤ n ≈ log log n. Therefore, total time = O(n log log n). Since log log n grows extremely slowly (log log 10⁹ ≈ 3), this is practically linear.

Optimizations: (1) Bitwise sieve: use bits instead of bytes, reducing memory by 8×. (2) Wheel factorization: skip multiples of 2, 3, 5 automatically. (3) Segment the sieve for cache efficiency.

Q3. How are prime numbers used in RSA encryption?

RSA Key Generation Process:

1. Choose two large primes p and q (each 300+ digits in practice).

2. Compute n = p × q. This is the modulus. n is made public; p and q are kept secret.

3. Compute Euler's totient: φ(n) = (p−1)(q−1). This is kept secret.

4. Choose public exponent e such that 1 < e < φ(n) and gcd(e, φ(n)) = 1. Commonly e = 65537.

5. Compute private exponent d = e⁻¹ mod φ(n) using the Extended Euclidean Algorithm.

6. Public key: (n, e). Private key: (n, d).

Encryption: c = m^e mod n. Decryption: m = c^d mod n.

Why large primes are essential: The security relies on the computational difficulty of factoring n back into p and q. For RSA-2048, n has 617 digits. The best known factoring algorithms (General Number Field Sieve) would take billions of years with current computing power.

Indian Context: India's UPI (Unified Payments Interface), managed by NPCI, processes 10+ billion transactions monthly. Every transaction uses TLS (Transport Layer Security) which relies on RSA or ECDSA for key exchange. Aadhaar authentication also uses RSA-2048 for securing biometric data transmission. The security of ₹20+ lakh crore in monthly digital payments rests on the difficulty of factoring large primes.

Section I

Lab Programs from Syllabus (5)

Lab 1: Finding Prime Factors by Square Root Method

⏱️ 30 minBeginner

Objective:

Find all prime factors of a number using trial division up to √n.

Algorithm:

While n is divisible by 2, print 2 and divide n by 2.
For odd i from 3 to √n: while n is divisible by i, print i and divide.
If n > 1, print n (it's a remaining prime factor).

C++
#include <iostream>
#include <cmath>
using namespace std;
void primeFactors(int n) {
    while (n % 2 == 0) { cout << 2 << " "; n /= 2; }
    for (int i=3; i*i<=n; i+=2) {
        while (n%i==0) { cout<<i<<" "; n/=i; }
    }
    if (n>1) cout<<n;
    cout<<endl;
}
int main() { primeFactors(84); return 0; }

2 2 3 7

Viva Voce:

V1: What is prime factorization? — Expressing a number as a product of prime powers.

V2: Why check up to √n? — If n has a factor > √n, it must have a corresponding factor < √n.

V3: Time complexity? — O(√n).

V4: Can this handle very large numbers? — Up to ~10⁹ efficiently; beyond that, use Pollard's rho.

V5: State the Fundamental Theorem of Arithmetic. — Every integer > 1 has a unique prime factorization.

Lab 2: Fermat Primality Test

⏱️ 30 minIntermediate

Objective:

Implement probabilistic primality test using Fermat's Little Theorem.

Python
import random
def power_mod(base, exp, mod):
    result = 1
    base %= mod
    while exp > 0:
        if exp % 2 == 1: result = (result * base) % mod
        exp //= 2
        base = (base * base) % mod
    return result

def fermat_test(n, k=10):
    if n < 2: return False
    if n < 4: return True
    for _ in range(k):
        a = random.randint(2, n-2)
        if power_mod(a, n-1, n) != 1: return False
    return True

for n in [7, 15, 561, 997]:
    print(f"{n}: {'Prime' if fermat_test(n) else 'Composite'}")

7: Prime 15: Composite 561: Prime ← FALSE POSITIVE! (Carmichael number) 997: Prime

Viva Voce:

V1: State Fermat's Little Theorem. — If p is prime and gcd(a,p)=1, then a^(p-1) ≡ 1 (mod p).

V2: What are Carmichael numbers? — Composites that pass Fermat test for all coprime bases (e.g., 561).

V3: Why multiple iterations? — Each iteration with a random base reduces false positive probability.

V4: What is modular exponentiation? — Computing a^b mod m in O(log b) using repeated squaring.

V5: How does Miller-Rabin improve on Fermat? — It detects Carmichael numbers using strong pseudoprime tests.

Lab 3: Sieve of Eratosthenes

⏱️ 30 minBeginner

Objective:

Generate all primes up to N using the Sieve of Eratosthenes.

C++
#include <iostream>
#include <vector>
using namespace std;
int main() {
    int n = 50;
    vector<bool> sieve(n+1, true);
    sieve[0] = sieve[1] = false;
    for (int p=2; p*p<=n; p++)
        if (sieve[p])
            for (int j=p*p; j<=n; j+=p)
                sieve[j] = false;
    cout << "Primes up to " << n << ": ";
    for (int i=2; i<=n; i++)
        if (sieve[i]) cout << i << " ";
    return 0;
}

Primes up to 50: 2 3 5 7 11 13 17 19 23 29 31 37 41 43 47

Viva Voce:

V1: Time complexity? — O(n log log n).

V2: Space complexity? — O(n) for the boolean array.

V3: Why start marking from p²? — Smaller multiples of p were already marked by smaller primes.

V4: Can we optimize memory? — Yes, bitwise sieve uses bits instead of bytes (8× savings).

V5: What is the prime counting function π(n)? — Number of primes ≤ n. π(10⁶) = 78498.

Lab 4: Segmented Sieve

⏱️ 45 minAdvanced

Objective:

Find all primes in a given range [L, R] using the segmented sieve approach.

Python
import math
def segmented_sieve(L, R):
    limit = int(math.sqrt(R)) + 1
    mark = [True] * (limit + 1)
    primes = []
    for p in range(2, limit + 1):
        if mark[p]:
            primes.append(p)
            for j in range(p*p, limit+1, p): mark[j] = False
    is_prime = [True] * (R - L + 1)
    for p in primes:
        start = max(p*p, ((L+p-1)//p)*p)
        for j in range(start, R+1, p): is_prime[j-L] = False
    if L <= 1: is_prime[1-L] = False
    return [L+i for i in range(R-L+1) if is_prime[i]]
print(segmented_sieve(100, 150))

[101, 103, 107, 109, 113, 127, 131, 137, 139, 149]

Viva Voce:

V1: Why not use a regular sieve for large ranges? — Memory constraint: sieve of 10⁹ needs ~1 GB.

V2: Memory advantage? — O(√R) vs O(R).

V3: How to choose segment size? — Typically √R or cache size for optimal performance.

V4: Time complexity? — O((R−L+1) log log R + √R).

V5: Applications? — SPOJ PRIME1, Project Euler problems, counting primes in ranges.

Lab 5: Sieve of Atkins

⏱️ 45 minAdvanced

Objective:

Implement the Sieve of Atkins using quadratic forms.

Python
import math
def sieve_of_atkins(limit):
    sieve = [False] * (limit + 1)
    for x in range(1, int(math.sqrt(limit)) + 1):
        for y in range(1, int(math.sqrt(limit)) + 1):
            n = 4*x*x + y*y
            if n <= limit and (n%12 == 1 or n%12 == 5):
                sieve[n] = not sieve[n]
            n = 3*x*x + y*y
            if n <= limit and n%12 == 7:
                sieve[n] = not sieve[n]
            n = 3*x*x - y*y
            if x > y and n <= limit and n%12 == 11:
                sieve[n] = not sieve[n]
    for r in range(5, int(math.sqrt(limit)) + 1):
        if sieve[r]:
            for i in range(r*r, limit+1, r*r):
                sieve[i] = False
    return [2,3] + [i for i in range(5, limit+1) if sieve[i]]

print(sieve_of_atkins(100))

[2, 3, 5, 7, 11, 13, 17, 19, 23, 29, 31, 37, 41, 43, 47, 53, 59, 61, 67, 71, 73, 79, 83, 89, 97]

Viva Voce:

V1: How does Atkins differ from Eratosthenes? — Uses quadratic forms instead of marking multiples.

V2: What are the three quadratic forms? — 4x²+y², 3x²+y², 3x²-y² with specific modular conditions.

V3: Time complexity? — O(n / log log n), slightly better than Eratosthenes asymptotically.

V4: When is Atkins faster? — Theoretically for very large n, but optimized Eratosthenes often wins in practice.

V5: Who invented it? — A.O.L. Atkin and Daniel J. Bernstein, published in 2004.

Section J

Industry Spotlight — A Day in the Life

👩‍💻 Priya Nair, 26 — Cryptography Engineer at Razorpay, Bangalore

Background: B.Tech in Computer Science from NIT Calicut. Competitive programmer — Codeforces rating 1800+ (Expert), CodeChef 5-star. Interned at Razorpay in 3rd year working on payment gateway security. Converted to full-time offer. Now works on cryptographic modules for payment processing.

A Typical Day:

9:00 AM — Morning standup with the security engineering team. Review overnight security alerts and vulnerability reports.

10:00 AM — Code review of TLS implementation updates. Check certificate validation logic and cipher suite configurations.

11:30 AM — Implement and test prime generation module for the internal key management system. Benchmark with various key sizes (1024, 2048, 4096 bits).

1:00 PM — Lunch at Razorpay's Bangalore office. Participate in a Codeforces virtual contest during break.

2:00 PM — Research post-quantum cryptography algorithms. Write internal documentation on lattice-based alternatives to RSA.

4:00 PM — Pair programming with a junior developer on implementing RSA-OAEP padding scheme. Review their understanding of prime generation.

5:30 PM — Security audit of a new payment API endpoint. Run automated vulnerability scans using OpenSSL toolkit.

Detail	Info
Tools Used Daily	OpenSSL, C++, Python, Go, HSMs (Hardware Security Modules), Wireshark
Entry Salary (2024)	₹8–12 LPA + ESOPs
Mid-Level (3–5 yrs)	₹15–25 LPA
Senior (7+ yrs)	₹30–50 LPA
Companies Hiring	Razorpay, PhonePe, Visa, Mastercard, NPCI, Juspay, Pine Labs, BharatPe, PayU, Cashfree

Section K

Earn With It — Freelance & Income Roadmap

💰 Your Earning Path After This Chapter

Portfolio Pieces: Implement and showcase primality testing library on GitHub. Solve 50+ competitive programming problems involving primes.

Earning Avenues:

• Competitive programming prizes (Codeforces, CodeChef, Google Code Jam) — ₹5,000–₹50,000 per contest

• Bug bounty programs (HackerOne India: Razorpay, Paytm, CRED) — ₹5,000–₹5,00,000 per bug

• Freelance security auditing and crypto implementation — ₹10,000–₹50,000/project

• Technical writing about algorithms and cryptography — ₹2,000–₹8,000/article

• Open-source contributions to crypto libraries — builds reputation for high-paying jobs

Platform	Best For	Typical Rate
Codeforces/CodeChef	Contest prizes, building competitive programming profile	₹5,000–₹1,00,000/contest
HackerOne India	Bug bounty programs (Razorpay, Paytm, etc.)	₹5,000–₹5,00,000/vulnerability
Toptal/Upwork	Freelance security consulting, crypto implementations	$30–$100/hour
GitHub Sponsors	Open-source crypto library contributions	$50–$500/month (sponsorship)
Medium/Dev.to	Technical articles on algorithms and cryptography	₹2,000–₹8,000/article

Fastest path to earning: Solve 100 competitive programming problems on Codeforces (focus on number theory tag). Reach 1400+ rating. Then participate in rated contests for prize money and build a profile that attracts job offers from top companies.

Section L

Chapter Summary

🔑 Key Takeaways — Primality Testing

A prime number has exactly two distinct factors: 1 and itself. 2 is the only even prime.
Naive O(n) test: check all divisors from 2 to n−1. Too slow for n > 10⁶.
O(√n) optimization: if n is composite, it has a factor ≤ √n. Check only up to √n.
6k±1 trick: all primes > 3 are of the form 6k±1, letting us skip 2/3 of candidates.
Trial division factorization finds all prime factors in O(√n) by dividing by smallest factors repeatedly.
Fermat's Little Theorem: a^(p−1) ≡ 1 (mod p) for prime p. Basis for probabilistic testing.
Carmichael numbers (561, 1105, 1729) fool the Fermat test — they're composite but pass all coprime bases.
Sieve of Eratosthenes ⭐: generates ALL primes up to n in O(n log log n). The most important algorithm in this unit.
Sieve optimization: start marking from p², not 2p, since smaller multiples were already marked.
Segmented Sieve: handles ranges where regular sieve exceeds memory. Uses O(√R) memory.
Sieve of Atkins: uses quadratic forms. O(n/log log n) time. Mainly theoretical interest.
Prime factorization enables counting divisors: if N = p₁^a₁ × p₂^a₂ × ..., divisors = Π(aᵢ+1).
RSA encryption relies on the difficulty of factoring the product of two large primes.
India's UPI/NPCI system uses RSA-2048 to secure 10+ billion monthly transactions.
For single queries: use O(√n). For bulk generation: use Sieve. For very large n: use Miller-Rabin.
Every competitive programmer must know Sieve of Eratosthenes — it appears in 20%+ of number theory problems.
Next prime palindrome: combine palindrome check with primality test. Prime gaps near n ≈ ln(n).
Primes are infinite (Euclid's proof, ~300 BC) and their distribution follows the Prime Number Theorem: π(n) ≈ n/ln(n).

Section M

Earning Checkpoint

Skill	Tool / Method	Portfolio Deliverable	Ready to Earn?
O(√n) Primality Test	C++ / Python	Competitive programming solutions on Codeforces	✅ Yes — foundation for contest problems
Prime Factorization	Trial Division	Number theory problem solutions	✅ Yes — common in interviews
Fermat Primality Test	Modular Exponentiation	Crypto-related implementations	✅ Yes — useful in security projects
Sieve of Eratosthenes	Boolean Array + Iteration	Efficient prime generator library	✅ Yes — essential for competitive coding
Segmented Sieve	Range-based Sieving	Advanced contest solutions (SPOJ PRIME1)	✅ Yes — demonstrates advanced skills
Cryptography Concepts	RSA, Modular Arithmetic	Security audit reports, crypto implementations	⬜ Partially — need more depth for professional work

Minimum Viable Earning Setup after this chapter: A Codeforces profile with 50+ solved number theory problems + a GitHub repository showcasing your primality testing library + an active competitive programming practice routine = you can earn ₹5,000–₹25,000/month from contest prizes, bug bounties, and freelance algorithm work.

✅ Unit 2 complete. Ready for Unit 3!

[QR: Link to EduArtha video tutorial — Primality Testing Deep Dive]