| name | ai-data-foundation-audit |
| description | Audit and prepare client data for AI marketing readiness. Produces a completed data hygiene checklist, a data mapping framework, and a 30-day remediation plan so that the client's data is clean and structured before any AI marketing tool is deployed. Invoke before any AI marketing tool deployment — this skill establishes the data quality foundation that all AI marketing activities depend on. Also invoke when a client reports inconsistent AI outputs, poor personalisation results, or chatbot errors.
|
AI Data Foundation Audit
Use when
- Audit and prepare client data for AI marketing readiness. Produces a completed data hygiene checklist, a data mapping framework, and a 30-day remediation plan so that the client's data is clean and structured before any AI marketing tool is deployed. Invoke before any AI marketing tool deployment — this skill establishes the data quality foundation that all AI marketing activities depend on. Also invoke when a client reports inconsistent AI outputs, poor personalisation results, or chatbot errors.
- Use this skill when it is the closest match to the requested deliverable or workflow.
Do not use when
- Do not use this skill for graphic design, video production, software development, or legal advice beyond the repository's stated scope.
- Do not use it when another skill in this repository is clearly more specific to the requested deliverable.
Workflow
- Collect the required inputs or source material before drafting, unless this skill explicitly generates the intake itself.
- Follow the section order and decision rules in this
SKILL.md; do not skip mandatory steps or required fields.
- Review the draft against the quality criteria, then deliver the final output in markdown unless the skill specifies another format.
Anti-Patterns
- Do not invent client facts, performance data, budgets, or approvals that were not provided or clearly inferred from evidence.
- Do not skip required inputs, mandatory sections, or quality checks just to make the output shorter.
- Do not drift into out-of-scope work such as code implementation, design production, or unsupported legal conclusions.
Outputs
- An AI-focused strategy, audit, system design, or prompt asset in markdown with human review and control points.
References
- Use the inline instructions in this skill now. If a
references/ directory is added later, treat its files as the deeper source material and keep this SKILL.md execution-focused.
Required Input
Ask for the following before generating any deliverable:
- Client business name — trading name as it appears to customers
- Industry — e.g. retail, hospitality, healthcare, professional services
- Country/city — default: Uganda / East Africa if not specified
- All customer data sources currently in use — tick all that apply:
WhatsApp chat history | Facebook Page DMs and comments | Email (Gmail /
Outlook) | CRM software (name it) | Excel / Google Sheets | Accounting
software (e.g. QuickBooks, Wave) | Other (describe)
- Primary AI use case the data will support — select one:
Content personalisation | Chatbot knowledge base | Predictive analytics |
Audience segmentation
- Approximate number of customer records — total across all systems
(even a rough estimate is useful)
Why Data Quality is Step 1
Three independent books converge on the same conclusion: data quality is the
single most important prerequisite for any AI marketing investment (Venkatesan
and Lecinski, 2026; Lamplugh, 2024; Ltifi, 2025). AI tools amplify what they
receive — clean data produces better outputs; dirty data produces worse outputs
at speed.
For East African clients specifically, the data challenge is structural.
Customer data is fragmented across WhatsApp chat histories, Facebook Page DMs,
manual Excel spreadsheets, phone contacts, and verbal records. Before any AI
tool can add value, this fragmentation must be addressed.
The investment in data hygiene is not a technical exercise — it is the
commercial prerequisite for every AI use case. Skipping this step does not
save time; it wastes the cost of the AI tool.
The EA Data Reality
Map the typical Ugandan SME data landscape honestly before proceeding:
- WhatsApp — customer enquiries, orders, and complaints are stored in chat
history; rarely exported or structured; impossible to query or analyse at
scale
- Facebook Page — post comments, DM threads, and page insights are
exportable via Meta Business Suite but rarely exported; valuable for
sentiment and engagement data
- Email — enquiries and newsletter replies sit in Gmail or Outlook, rarely
in a CRM; no linkage to purchase data
- Excel spreadsheets — customer names, phone numbers, and purchase history
recorded manually; often incomplete, inconsistent formatting, and not
regularly updated
- Verbal / in-person — walk-in customers and cash transactions frequently
go unrecorded entirely
- Result — the same customer may appear in three or four separate systems
with different name spellings, no unique identifier, and no purchase history
linkage. AI tools cannot perform matching, personalisation, or segmentation
on this data without prior consolidation.
Data Hygiene Checklist (20 Items)
Complete this checklist for the client. Score each item Yes / No / Partial.
Flag any No or Partial item as a remediation priority.
Completeness
Accuracy
Consistency
Accessibility
Governance
Data Mapping Framework
Follow this step-by-step process to consolidate fragmented data into a single
master customer list:
- List all data sources — WhatsApp, Facebook, email, Excel, accounting
software, and any others identified in the Required Input.
- For each source, record: what data it contains; its format; estimated
record count; date last updated.
- Identify the primary identifier — the field that will link records
across systems. Phone number is the most reliable identifier in Uganda,
because it is more consistent than email across both urban and peri-urban
customers.
- Export all sources to Excel or CSV — request exports from WhatsApp
(chat export), Meta Business Suite (contacts and insights), Gmail
(contacts), and any CRM or accounting software in use.
- Standardise the phone number field across all exports — remove spaces,
remove hyphens, ensure all numbers use the +256 international format.
- Use VLOOKUP or a de-duplication tool to identify records appearing in
multiple sources. Google Sheets has a built-in Remove Duplicates function;
Excel has a Remove Duplicates tool under the Data tab.
- Build a master customer list — one row per customer, linking all data
points: name, phone number, email (if available), source systems, last
transaction date, and any segment tags.
30-Day Remediation Plan Template
Apply this plan after the hygiene checklist is complete and the top three
issues are identified:
Week 1 — Map and assess
Complete the data mapping framework. Run the hygiene checklist for all 20
items. Identify the top three issues (typically: missing contact details,
duplicate records, and inconsistent product naming). Assign a named owner for
each issue.
Week 2 — Fix completeness
Fill missing contact details by following up with customers via WhatsApp or
in-person. Add missing transaction dates from accounting records or receipts.
Ensure all product and service names follow a single agreed naming convention.
Week 3 — Fix accuracy
Standardise all phone numbers to +256 format. Correct misspellings in customer
names. Merge duplicate records into the master list. Flag and separate inactive
customers.
Week 4 — Implement governance
Assign the named data owner formally. Document the new record process: how new
customers are added, which fields are required, and who is responsible.
Confirm Uganda Data Protection and Privacy Act (2019) compliance — consent
records in place, retention policy documented. Export the final master customer
list and confirm it is ready for AI tool import.
Output: a clean master customer list ready for import into the AI tool of
choice, plus a documented data governance process that prevents regression.
Tool Options
| Tool | Use | EA accessibility | Approx. cost |
|---|
| Google Sheets | Data consolidation and de-duplication | Yes — free | Free |
| Airtable | Structured database with easy import/export | Yes — free tier | Free tier available |
| HubSpot Free CRM | Customer database with basic automation | Yes — free | Free |
| Notion | Flexible database and knowledge base | Yes — free tier | Free tier available |
| Excel | Standard spreadsheet de-duplication | Yes | Included in Office 365 |
For most Ugandan SMEs, Google Sheets is the recommended starting point: it is
free, collaborative, accessible on mobile, and sufficient for up to several
thousand customer records.
Handoff — Connecting Clean Data to AI Tools
Once the audit and remediation are complete, connect the master customer list
to the relevant AI tools:
- RAG knowledge base (see
ai-rag-brand-knowledge-base) — upload the
product catalogue and FAQs derived from the clean data
- Chatbot knowledge base (see
ai-whatsapp-chatbot-design) — upload
approved response templates and policies built from the structured data
- Segmentation tool — import the master customer list with segment tags
for audience targeting
- Predictive analytics tool (see
ai-predictive-analytics-social) —
import transaction history and engagement history for churn and upsell
modelling
Do not connect any AI tool until the Week 4 governance step is complete. A
tool connected to unclean data will produce unreliable outputs and undermine
client confidence in AI marketing.
Quality Criteria
Good output from this skill meets all of the following standards:
- All customer data sources mapped — no source overlooked, including informal
sources such as verbal records and WhatsApp chat histories
- Data hygiene checklist completed in full — all 20 items scored Yes, No, or
Partial with priority flags applied to every No and Partial item
- Data mapping framework completed — master customer list built using phone
number as the primary identifier, with all sources consolidated
- Top three hygiene issues identified with a specific, actionable remediation
step for each — not generic advice, but concrete next actions for this client
- 30-day remediation plan documented with named owner and weekly milestones —
not a suggestion but a committed plan
- Data governance framework in place — named data owner, new record process
documented, and responsibility assigned
- Uganda Data Protection and Privacy Act (2019) compliance confirmed — consent
documented for all records, data sharing restrictions understood, retention
policy set
- Clean data handed off to at least one AI tool with a verification step
confirming that outputs have improved compared to pre-audit baseline
References
- Venkatesan, R. and Lecinski, J. (2026) The AI Marketing Canvas, 2nd edn.
Stanford University Press.
- Lamplugh, M. (2024) The AI Marketing Playbook, 2nd edn. Mercury Learning.
- Ltifi, M. (ed.) (2025) Advances in Digital Marketing in the Era of
Artificial Intelligence. CRC Press.