P6 Variety Checkpoint

Where the new question families came from, what they add, and how they widen both template variety and difficulty variety
24 March 2026
Current Bank
274
P6 questions1
Families Reviewed
10
new candidate families1
Safe To Build
4
`adopt` families1
Advanced Gap
0%
current advanced coverage1

I. What This Checkpoint Is

This is the checkpoint after the first P6 difficulty-expansion pass. Instead of jumping straight into generator code, we assembled a reviewable set of question families, tagged their provenance, and filtered them through the local atomic graph plus the HK curriculum.1

The goal is not to import “harder foreign questions” blindly. The goal is to improve variety in question templates and variety in difficulty while staying syllabus-safe and debuggable later.2

Core rule If a SG or UK family overlaps with neither the atomic graph nor the HK curriculum, it stays out of the core P6 bank. That lets us use outside systems as pattern donors without letting them quietly rewrite the syllabus.

II. Baseline: What The Current Bank Already Does Well

The current P6 bank covers the TSA floor well: forward percentages, direct averages, direct speed formula, ratio division, formula-based area/volume, and standard data handling.1

Basic
204
Intermediate
70
Advanced
0
The actual problem is not “coverage.” It is ceiling. The bank covers many topics, but most of its prompts are still single-topic, forward-reasoning, and formulaic. The ceiling flattens before HKAT-level reasoning starts.1

III. Source Provenance: What Each System Contributes

Different source systems contribute different kinds of value. HKAT is the main local spine. SG and UK contribute reasoning styles. HKMO/HKIMO show the ceiling but should not leak into the core bank.3456

SourceWhat It Gives UsStatus In This Checkpoint
HKAT Cross-domain questions, explicit equation use, harder-but-local benchmark PRIMARY SPINE
HK school tiered objectives Evidence for what “high” looks like inside HK primary practice LOCAL VALIDATION
UK SATs Backward / missing-number reasoning and answer-checking habits PATTERN DONOR
PSLE Optimization and comparison styles STRETCH ONLY
HKMO / HKIMO Competition ceiling and enrichment lane HOLD OUT
Internal synthesis Combining already-owned skills into new families CASE BY CASE

IV. Variety In Question Templates

Template variety is not just “more topics.” It is more ways for a student to be asked to think. This checkpoint expands the bank along five meaningful directions.12

HKAT ADOPT

Backward Reasoning

Move from “apply a formula” to “recover the hidden original.”

Examples: original price from sale price, required score from target average.

HKAT ADOPT

Cross-Topic Integration

Move from single-topic prompts to data-plus-percentage or discount-plus-equation questions.

This is the cleanest way to make the bank feel more exam-like without leaving syllabus.

HK LOCAL ADAPT

Changing-Base Percentages

Move from one-shot percentage change to sequential change on changing bases.

Example: 27% removed, then 40% of the remainder.

UK / TSA ADAPT

Estimation And Reasonableness

Move from exact calculation only to “does this answer even make sense?”

This improves number sense, not just procedure.

INTERNAL SYNTHESIS ADAPT

Skill Fusion

Use existing owned skills in new combinations, such as speed plus ratio comparison.

Useful, but teacher validation should come first.

PSLE / HKMO OUT

Optimization And Competition Patterns

These are strong for enrichment, but they currently fail the overlap test for the core bank.

Good inspiration, bad default import.


V. Variety In Difficulty

The checkpoint is not only adding new prompt shapes. It is deliberately adding the missing upper band. Current production P6 has no advanced questions after calibration.1

Difficulty leverOld bank tendencyNew checkpoint shift
Reasoning direction Forward only Backward and reverse reasoning
Topic count One topic per item Two or three topics interleaved
Information use All information directly useful Interpret, filter, or compare information
Decision demand Compute a single answer Judge, compare, justify, or choose best fit
Mathematical communication Short numeric answer Equation setup or explanation-like structure

What We Want

  • A visible advanced band inside P6
  • Questions that feel meaningfully different, not just numerically harder
  • Difficulty uplift that still looks local to HK teachers

What We Avoid

  • Random hard questions imported from other systems
  • Competition-style drift into out-of-syllabus content
  • “Harder” meaning only bigger numbers or more arithmetic

VI. Added Question Families At This Checkpoint

The table below is the practical build list. It is where provenance, template variety, and difficulty variety meet.

FamilySample IDsSourceDecisionWhat It Improves
Backward percentage / recover original BR-1, BR-3 HKAT + HK school tiered objectives ADOPT Backward reasoning + equation-friendly difficulty
Backward average / required score BR-2 HKAT + UK SATs reasoning style ADOPT Reverse mean reasoning + threshold thinking
Sequential percentages SP-1, SP-2 HK school tiered objectives ADOPT Changing-base percentage reasoning
Data reading + percentage CT-1 HKAT ADOPT Cross-domain question shape
Discount + equation CT-2 HKAT ADOPT Equation setup + percentage integration
Speed + ratio comparison CT-3 Internal synthesis / PSLE-style comparison ADAPT New fused template from owned skills
Deal comparison / least-cost package CO-1, CO-2 PSLE OUT Comparison / optimization inspiration only
Estimation / reasonableness EST-1, EST-2 TSA + UK SATs ADAPT Number sense and answer-checking
Pattern / sequence families held out UK SATs + competition OUT Enrichment only
Competition families held out HKMO / HKIMO OUT Ceiling reference, not core syllabus

VII. Sample Highlights

The point of the sample set is to show the flavor of the uplift, not to pretend all 12 belong in production as-is.1

HKAT: Discount + Equation

“A stationery shop has a 25% off sale. A teacher buys 15 identical notebooks and pays $157.50 in total. Using an equation, find the original price of one notebook.”
This improves both template variety and difficulty variety at once: the student must form an equation, not just apply a discount formula.2

HKAT + UK SATs style: Backward Average

“David scored 78, 85, 92, and 71 on his first four tests. What minimum score does he need on the fifth test to achieve an average of at least 82?”
This turns a familiar average topic into a reverse-reasoning problem, which is exactly the kind of ceiling lift the current bank lacks.4

HK Local High Tier: Sequential Percentages

“A warehouse had 3000 boxes. 27% were moved yesterday. Today, 40% of the remainder were moved. How many remain?”
The key difficulty is not computation. It is resisting the instinct to add 27% and 40% directly. This is a very local, syllabus-safe way to raise difficulty.7

Internal Synthesis: Speed + Ratio

“Mei Ling cycles to school at 12 km/h and walks home at 4 km/h. The school is 6 km away. What is the ratio of her cycling time to her walking time?”
This shows how two owned skills can be fused into a more interesting prompt. It should stay stretch-only until teacher-validated, but it is a promising way to expand without importing foreign content wholesale.

PSLE: Deal Comparison (Held Out)

Least-cost ticket/package comparison problems.
These are excellent at producing genuine comparison reasoning, but right now they fail the overlap test. Good inspiration, wrong default import.3

VIII. What We Should Build Next

Build now: backward percentage, backward average, data-plus-percentage cross-domain, and discount-plus-equation.

Build as stretch: sequential percentages, speed-plus-ratio, estimation / reasonableness.

Hold out: PSLE-style optimization, pattern families, and competition-only content.

Why this order works: it gives us visible variety gains while keeping the core bank locally defensible.

Checkpoint output This checkpoint is already enough to explain the strategy to Leslie and Renee: we are improving question variety by adding new reasoning shapes, and improving difficulty variety by adding a controlled advanced band, not by random hardening.

Verdict

The right next move is selective uplift, not broad import. HKAT should be the main backbone for P6 difficulty expansion. UK SATs and PSLE are useful only when they donate a reasoning pattern that still survives local overlap checks. HKMO/HKIMO should remain ceiling references rather than core bank inputs.

Template variety improves when we move from forward single-topic prompts into backward, cross-domain, and judgment-heavy prompts. Difficulty variety improves when those new prompts introduce reasoning reversal, topic interleaving, filtering, and equation setup in a controlled way.

Practical build order: generatorize the four `adopt` families now, keep the three `adapt` families as stretch with teacher validation, and keep the `out_of_syllabus` families out of the main P6 bank until school materials confirm them.


References

[1] P6 Diversity Review — existing review page for the current bank, candidate families, and sample questions. Internal baseline counts, family decisions, and sample set
[2] Pan Lloyds HKAT Demo Paper — Pre-S1 HKAT practice material. Cross-domain items and discount-plus-equation question shape
[3] PSLE 2024 Analysis — Singapore pattern donor for optimization and comparison questions. Useful inspiration, but not currently core-bank safe
[4] UK KS2 SATs 2024 Analysis — reasoning-paper patterns. Backward reasoning and missing-number style
[5] HKMO 2023 P5 Paper — competition ceiling reference. Not core-bank input; used to define the ceiling
[6] HKIMO Sample Papers — competition ceiling reference. Enrichment lane only
[7] HK School Tiered Objectives — local low / medium / high examples. Confirms sequential percentage and higher-tier local difficulty patterns
[8] HKEAA TSA Papers — P6 baseline. Defines the current floor rather than the desired ceiling