| name | policyengine-aggregation |
| description | PolicyEngine aggregation patterns - using adds attribute and add() function for summing variables across entities |
PolicyEngine Aggregation Patterns
Essential patterns for summing variables across entities in PolicyEngine.
Quick Decision Guide
Is the variable ONLY a sum of other variables?
│
├─ YES → Use `adds` attribute (NO formula needed!)
│ adds = ["var1", "var2"]
│
└─ NO → Use `add()` function in formula
(when you need max_, where, conditions, etc.)
Quick Reference
| Need | Use | Example |
|---|
| Simple sum | adds | adds = ["var1", "var2"] |
| Sum from parameters | adds | adds = "gov.path.to.list" |
| Sum + max_() | add() | max_(0, add(...)) |
| Sum + where() | add() | where(cond, add(...), 0) |
| Sum + conditions | add() | if cond: add(...) |
| Count booleans | adds | adds = ["is_eligible"] |
1. adds Class Attribute (Preferred When Possible)
When to Use
Use adds when a variable is ONLY the sum of other variables with NO additional logic.
Syntax
class variable_name(Variable):
value_type = float
entity = Entity
definition_period = PERIOD
adds = ["variable1", "variable2", "variable3"]
adds = "gov.path.to.parameter.list"
Key Points
- ✅ No
formula() method needed
- ✅ Automatically handles entity aggregation (person → household/tax_unit/spm_unit)
- ✅ Clean and declarative
Example: Simple Income Sum
class tanf_gross_earned_income(Variable):
value_type = float
entity = SPMUnit
label = "TANF gross earned income"
unit = USD
definition_period = MONTH
adds = ["employment_income", "self_employment_income"]
Example: Using Parameter List
class income_tax_refundable_credits(Variable):
value_type = float
entity = TaxUnit
definition_period = YEAR
adds = "gov.irs.credits.refundable"
Example: Counting Boolean Values
class count_eligible_people(Variable):
value_type = int
entity = SPMUnit
definition_period = YEAR
adds = ["is_eligible_person"]
2. add() Function (When Logic Needed)
When to Use
Use add() inside a formula() when you need:
- To apply
max_(), where(), or conditions
- To combine with other operations
- To modify values before/after summing
Syntax
from policyengine_us.model_api import *
def formula(entity, period, parameters):
result = add(entity, period, variable_list)
Parameters:
entity: The entity to operate on
period: The time period for calculation
variable_list: List of variable names or parameter path
Example: With max_() to Prevent Negatives
class adjusted_earned_income(Variable):
value_type = float
entity = SPMUnit
definition_period = MONTH
def formula(spm_unit, period, parameters):
gross = add(spm_unit, period, ["employment_income", "self_employment_income"])
return max_(0, gross)
Example: With Additional Logic
class household_benefits(Variable):
value_type = float
entity = Household
definition_period = YEAR
def formula(household, period, parameters):
BENEFITS = ["snap", "tanf", "ssi", "social_security"]
existing = add(household, period, BENEFITS)
new_benefit = household("special_benefit", period)
p = parameters(period).gov.special_benefit
if p.include_in_total:
return existing + new_benefit
return existing
Example: Building on Previous Variables
class total_deductions(Variable):
value_type = float
entity = TaxUnit
definition_period = YEAR
def formula(tax_unit, period, parameters):
p = parameters(period).gov.irs.deductions
standard = add(tax_unit, period, p.standard_items)
income = tax_unit("adjusted_gross_income", period)
phase_out_rate = p.phase_out_rate
phase_out_start = p.phase_out_start
reduction = max_(0, (income - phase_out_start) * phase_out_rate)
return max_(0, standard - reduction)
3. Common Anti-Patterns to Avoid
❌ NEVER: Manual Summing
def formula(spm_unit, period, parameters):
person = spm_unit.members
employment = person("employment_income", period)
self_emp = person("self_employment_income", period)
return spm_unit.sum(employment + self_emp)
✅ CORRECT: Use adds
adds = ["employment_income", "self_employment_income"]
❌ WRONG: Using add() When adds Suffices
def formula(spm_unit, period, parameters):
return add(spm_unit, period, ["income1", "income2"])
✅ CORRECT: Use adds
adds = ["income1", "income2"]
4. Entity Aggregation Explained
When using adds or add(), PolicyEngine automatically handles entity aggregation:
class household_total_income(Variable):
entity = Household
definition_period = YEAR
adds = ["employment_income", "self_employment_income"]
This works across all entity hierarchies:
- Person → Tax Unit
- Person → SPM Unit
- Person → Household
- Tax Unit → Household
- SPM Unit → Household
5. Parameter Lists
Parameters can define lists of variables to sum:
Parameter file (gov/irs/credits/refundable.yaml):
description: List of refundable tax credits
values:
2024-01-01:
- earned_income_tax_credit
- child_tax_credit
- additional_child_tax_credit
Usage in variable:
adds = "gov.irs.credits.refundable"
6. Decision Matrix
| Scenario | Solution | Code |
|---|
| Sum 2-3 variables | adds attribute | adds = ["var1", "var2"] |
| Sum many variables | Parameter list | adds = "gov.path.list" |
| Sum + prevent negatives | add() with max_() | max_(0, add(...)) |
| Sum + conditional | add() with where() | where(eligible, add(...), 0) |
| Sum + phase-out | add() with calculation | add(...) - reduction |
| Count people/entities | adds with boolean | adds = ["is_child"] |