P6 Variety Checkpoint — by Eric San

Current Bank

274

P6 questions¹

Families Reviewed

new candidate families¹

Safe To Build

`adopt` families¹

Advanced Gap

current advanced coverage¹

I. What This Checkpoint Is

This is the checkpoint after the first P6 difficulty-expansion pass. Instead of jumping straight into generator code, we assembled a reviewable set of question families, tagged their provenance, and filtered them through the local atomic graph plus the HK curriculum.¹

The goal is not to import “harder foreign questions” blindly. The goal is to improve variety in question templates and variety in difficulty while staying syllabus-safe and debuggable later.²

Core rule If a SG or UK family overlaps with neither the atomic graph nor the HK curriculum, it stays out of the core P6 bank. That lets us use outside systems as pattern donors without letting them quietly rewrite the syllabus.

II. Baseline: What The Current Bank Already Does Well

The current P6 bank covers the TSA floor well: forward percentages, direct averages, direct speed formula, ratio division, formula-based area/volume, and standard data handling.¹

Basic

204

Intermediate

Advanced

The actual problem is not “coverage.” It is ceiling. The bank covers many topics, but most of its prompts are still single-topic, forward-reasoning, and formulaic. The ceiling flattens before HKAT-level reasoning starts.¹

III. Source Provenance: What Each System Contributes

Different source systems contribute different kinds of value. HKAT is the main local spine. SG and UK contribute reasoning styles. HKMO/HKIMO show the ceiling but should not leak into the core bank.³⁴⁵⁶

Source	What It Gives Us	Status In This Checkpoint
HKAT	Cross-domain questions, explicit equation use, harder-but-local benchmark	PRIMARY SPINE
HK school tiered objectives	Evidence for what “high” looks like inside HK primary practice	LOCAL VALIDATION
UK SATs	Backward / missing-number reasoning and answer-checking habits	PATTERN DONOR
PSLE	Optimization and comparison styles	STRETCH ONLY
HKMO / HKIMO	Competition ceiling and enrichment lane	HOLD OUT
Internal synthesis	Combining already-owned skills into new families	CASE BY CASE

IV. Variety In Question Templates

Template variety is not just “more topics.” It is more ways for a student to be asked to think. This checkpoint expands the bank along five meaningful directions.¹²

HKAT ADOPT

Backward Reasoning

Move from “apply a formula” to “recover the hidden original.”

Examples: original price from sale price, required score from target average.

HKAT ADOPT

Cross-Topic Integration

Move from single-topic prompts to data-plus-percentage or discount-plus-equation questions.

This is the cleanest way to make the bank feel more exam-like without leaving syllabus.

HK LOCAL ADAPT

Changing-Base Percentages

Move from one-shot percentage change to sequential change on changing bases.

Example: 27% removed, then 40% of the remainder.

UK / TSA ADAPT

Estimation And Reasonableness

Move from exact calculation only to “does this answer even make sense?”

This improves number sense, not just procedure.

INTERNAL SYNTHESIS ADAPT

Skill Fusion

Use existing owned skills in new combinations, such as speed plus ratio comparison.

Useful, but teacher validation should come first.

PSLE / HKMO OUT

Optimization And Competition Patterns

These are strong for enrichment, but they currently fail the overlap test for the core bank.

Good inspiration, bad default import.

V. Variety In Difficulty

The checkpoint is not only adding new prompt shapes. It is deliberately adding the missing upper band. Current production P6 has no advanced questions after calibration.¹

Difficulty lever	Old bank tendency	New checkpoint shift
Reasoning direction	Forward only	Backward and reverse reasoning
Topic count	One topic per item	Two or three topics interleaved
Information use	All information directly useful	Interpret, filter, or compare information
Decision demand	Compute a single answer	Judge, compare, justify, or choose best fit
Mathematical communication	Short numeric answer	Equation setup or explanation-like structure

What We Want

A visible advanced band inside P6
Questions that feel meaningfully different, not just numerically harder
Difficulty uplift that still looks local to HK teachers

What We Avoid

Random hard questions imported from other systems
Competition-style drift into out-of-syllabus content
“Harder” meaning only bigger numbers or more arithmetic

VI. Added Question Families At This Checkpoint

The table below is the practical build list. It is where provenance, template variety, and difficulty variety meet.

Family	Sample IDs	Source	Decision	What It Improves
Backward percentage / recover original	BR-1, BR-3	HKAT + HK school tiered objectives	ADOPT	Backward reasoning + equation-friendly difficulty
Backward average / required score	BR-2	HKAT + UK SATs reasoning style	ADOPT	Reverse mean reasoning + threshold thinking
Sequential percentages	SP-1, SP-2	HK school tiered objectives	ADOPT	Changing-base percentage reasoning
Data reading + percentage	CT-1	HKAT	ADOPT	Cross-domain question shape
Discount + equation	CT-2	HKAT	ADOPT	Equation setup + percentage integration
Speed + ratio comparison	CT-3	Internal synthesis / PSLE-style comparison	ADAPT	New fused template from owned skills
Deal comparison / least-cost package	CO-1, CO-2	PSLE	OUT	Comparison / optimization inspiration only
Estimation / reasonableness	EST-1, EST-2	TSA + UK SATs	ADAPT	Number sense and answer-checking
Pattern / sequence families	held out	UK SATs + competition	OUT	Enrichment only
Competition families	held out	HKMO / HKIMO	OUT	Ceiling reference, not core syllabus

VII. Sample Highlights

The point of the sample set is to show the flavor of the uplift, not to pretend all 12 belong in production as-is.¹

HKAT: Discount + Equation

“A stationery shop has a 25% off sale. A teacher buys 15 identical notebooks and pays $157.50 in total. Using an equation, find the original price of one notebook.”

This improves both template variety and difficulty variety at once: the student must form an equation, not just apply a discount formula.²

HKAT + UK SATs style: Backward Average

“David scored 78, 85, 92, and 71 on his first four tests. What minimum score does he need on the fifth test to achieve an average of at least 82?”

This turns a familiar average topic into a reverse-reasoning problem, which is exactly the kind of ceiling lift the current bank lacks.⁴

HK Local High Tier: Sequential Percentages

“A warehouse had 3000 boxes. 27% were moved yesterday. Today, 40% of the remainder were moved. How many remain?”

The key difficulty is not computation. It is resisting the instinct to add 27% and 40% directly. This is a very local, syllabus-safe way to raise difficulty.⁷

Internal Synthesis: Speed + Ratio

“Mei Ling cycles to school at 12 km/h and walks home at 4 km/h. The school is 6 km away. What is the ratio of her cycling time to her walking time?”

This shows how two owned skills can be fused into a more interesting prompt. It should stay stretch-only until teacher-validated, but it is a promising way to expand without importing foreign content wholesale.

PSLE: Deal Comparison (Held Out)

Least-cost ticket/package comparison problems.

These are excellent at producing genuine comparison reasoning, but right now they fail the overlap test. Good inspiration, wrong default import.³

VIII. What We Should Build Next

Build now: backward percentage, backward average, data-plus-percentage cross-domain, and discount-plus-equation.

Build as stretch: sequential percentages, speed-plus-ratio, estimation / reasonableness.

Hold out: PSLE-style optimization, pattern families, and competition-only content.

Why this order works: it gives us visible variety gains while keeping the core bank locally defensible.

Checkpoint output This checkpoint is already enough to explain the strategy to Leslie and Renee: we are improving question variety by adding new reasoning shapes, and improving difficulty variety by adding a controlled advanced band, not by random hardening.

Verdict

The right next move is selective uplift, not broad import. HKAT should be the main backbone for P6 difficulty expansion. UK SATs and PSLE are useful only when they donate a reasoning pattern that still survives local overlap checks. HKMO/HKIMO should remain ceiling references rather than core bank inputs.

Template variety improves when we move from forward single-topic prompts into backward, cross-domain, and judgment-heavy prompts. Difficulty variety improves when those new prompts introduce reasoning reversal, topic interleaving, filtering, and equation setup in a controlled way.

Practical build order: generatorize the four `adopt` families now, keep the three `adapt` families as stretch with teacher validation, and keep the `out_of_syllabus` families out of the main P6 bank until school materials confirm them.

References

[1] P6 Diversity Review — existing review page for the current bank, candidate families, and sample questions. Internal baseline counts, family decisions, and sample set

[2] Pan Lloyds HKAT Demo Paper — Pre-S1 HKAT practice material. Cross-domain items and discount-plus-equation question shape

[3] PSLE 2024 Analysis — Singapore pattern donor for optimization and comparison questions. Useful inspiration, but not currently core-bank safe

[4] UK KS2 SATs 2024 Analysis — reasoning-paper patterns. Backward reasoning and missing-number style

[5] HKMO 2023 P5 Paper — competition ceiling reference. Not core-bank input; used to define the ceiling

[6] HKIMO Sample Papers — competition ceiling reference. Enrichment lane only

[7] HK School Tiered Objectives — local low / medium / high examples. Confirms sequential percentage and higher-tier local difficulty patterns

[8] HKEAA TSA Papers — P6 baseline. Defines the current floor rather than the desired ceiling