Ejecuta cualquier Skill en Manus
con un clic

Ejecuta cualquier Skill en Manus con un clic

$pwd:

managing-databases

Name: Managing Databases
Author: rileyhilliard

// Guides database architecture decisions for PostgreSQL, DuckDB, Parquet, PGVector, and Neo4j. Use when designing schemas, choosing storage strategies, optimizing queries, tuning maintenance, configuring vector search, modeling graph data, or diagnosing performance issues across OLTP, OLAP, similarity search, and graph workloads.

Ejecutar en Manus

$ git log --oneline --stat

stars:122

forks:19

updated:10 de mayo de 2026, 07:55

Explorador de archivos

11 archivos

SKILL.md

readonly

name	managing-databases
description	Guides database architecture decisions for PostgreSQL, DuckDB, Parquet, PGVector, and Neo4j. Use when designing schemas, choosing storage strategies, optimizing queries, tuning maintenance, configuring vector search, modeling graph data, or diagnosing performance issues across OLTP, OLAP, similarity search, and graph workloads.

Database Management

Decision guidance for PostgreSQL, DuckDB, Parquet, and Neo4j in hybrid storage architectures.

When to use which database
PostgreSQL quick reference
DuckDB quick reference
Parquet quick reference
PGVector quick reference
Neo4j quick reference
Cross-database conventions
Performance debugging checklist

When to use which database

Workload	Use	Why
Transactional (CRUD, users, sessions)	PostgreSQL	ACID, row-level locking, indexes
Analytical (aggregations, scans)	DuckDB	Columnar, vectorized, parallel
Data storage/interchange	Parquet	Compressed, columnar, portable
Metadata + relationships	PostgreSQL	Foreign keys, constraints
Ad-hoc exploration	DuckDB	Fast on Parquet, no ETL needed
Time-series with point lookups	PostgreSQL + partitioning	Partition pruning + indexes
Time-series analytics	DuckDB on Parquet	Scan performance
Vector similarity search	PostgreSQL + PGVector	HNSW/IVFFlat indexes, hybrid search
RAG / semantic search	PostgreSQL + PGVector	Embeddings + metadata in same DB
Graph traversals / relationships	Neo4j	Native graph, index-free adjacency
Pattern matching / fraud detection	Neo4j	Multi-hop traversal, path finding
Knowledge graphs / ontologies	Neo4j	Flexible schema, relationship-first

Hybrid pattern example:

PostgreSQL: transactional data, relationships, users (metadata)
DuckDB + Parquet: analytical content, aggregations, time-series

PostgreSQL quick reference

Use for: Metadata, relationships, OLTP workloads, anything needing ACID.

Key decisions:

Partition tables >100M rows or with retention requirements
Index columns in WHERE/JOIN clauses, not everything
Tune autovacuum for high-churn tables

See references/postgres-architecture.md for maintenance patterns. See references/postgres-querying.md for advanced query techniques.

DuckDB quick reference

Use for: Analytics, aggregations, Parquet queries, data exploration.

Key decisions:

Prefer Parquet files over CSV (10-100x faster)
Let DuckDB auto-parallelize; don't micro-optimize
For remote data, increase threads beyond CPU count

See references/duckdb-architecture.md for storage and parallelism. See references/duckdb-querying.md for DuckDB-specific SQL features.

Parquet quick reference

Use for: Storing analytical data, data interchange, columnar compression.

Key decisions:

Target 128MB-1GB file sizes
Partition by low-to-moderate cardinality columns (date, region)
Sort by columns used in filters for better pruning

See references/parquet-architecture.md for file design. See references/parquet-querying.md for query optimization.

PGVector quick reference

Use for: Similarity search, RAG applications, semantic search, recommendations.

Key decisions:

HNSW for low-latency, high-recall (default choice)
IVFFlat for memory-constrained or batch-updated data
Use iterative scan for filtered queries
Consider hybrid search (vector + keyword) for 8-15% accuracy boost

See references/pgvector-architecture.md for index configuration. See references/pgvector-querying.md for hybrid search and filtering.

Neo4j quick reference

Use for: Graph traversals, relationship-heavy queries, pattern matching, knowledge graphs.

Key decisions:

Model around your queries, not your source data
Promote properties to nodes when you need to traverse through shared values
Use specific relationship types to avoid supernode bottlenecks
Bound all variable-length paths ([*1..5], never [*])
Use parameters in Cypher for execution plan caching

See references/neo4j-architecture.md for data modeling, indexing, and maintenance. See references/neo4j-querying.md for Cypher optimization and anti-patterns.

Performance debugging checklist

PostgreSQL slow query

Run EXPLAIN (ANALYZE, BUFFERS) on the query
Check for sequential scans on large tables
Verify indexes exist on filter/join columns
Check pg_stat_user_tables for bloat (dead tuples)
Review work_mem if seeing disk sorts

DuckDB slow query

Check if reading CSV instead of Parquet
Verify not doing SELECT * on remote data
Check thread count matches workload
Look for unnecessary type conversions

Parquet slow reads

Verify predicate pushdown is working (check query plan)
Check file sizes (too small = overhead, too large = no parallelism)
Confirm data is sorted by filter columns
Look for high-cardinality partition keys (too many small files)

PGVector slow search

Verify index exists and is being used (EXPLAIN)
Check ef_search (HNSW) or probes (IVFFlat) settings
Enable iterative scan for filtered queries
Check if IVFFlat recall degraded (rebuild index if heavily updated)
Consider partial indexes for common filters

Neo4j slow query

Run PROFILE on the query, read operators bottom-up
Look for AllNodesScan or NodeByLabelScan (missing index)
Check for CartesianProduct (disconnected MATCH patterns)
Verify parameters are used instead of literals (plan caching)
Check for unbounded variable-length paths
Monitor page_cache.hit_ratio (below 98% = need more page cache memory)

related-skills.json

mismo repositorio

structuring-articles.md

from "rileyhilliard/claude-essentials"

Selects and applies professional journalistic story structures (WSJ Formula, Inverted Pyramid, Hourglass, Tick-Tock, etc.) based on the content being written. Use when writing articles, blog posts, features, essays, long-form content, news stories, trend pieces, investigative reports, profiles, or any narrative prose longer than a few paragraphs. Also use when the user asks for help structuring a piece, choosing a story framework, organizing a draft, outlining an article, or wants to know which article format fits their content. Trigger on requests like "help me structure this," "what format should I use," "write a feature about," "draft a blog post on," or any mention of story structure, article architecture, or narrative frameworks. Complements the writer skill (which handles tone and anti-AI rhetoric) by providing the structural blueprint.

2026-05-27122

writer.md

from "rileyhilliard/claude-essentials"

Writing style and tone guide for human-sounding content. Use when writing documentation, READMEs, commit messages, PR descriptions, blog posts, LinkedIn posts, social media content, or any user-facing content.

2026-05-27122

architecting-systems.md

from "rileyhilliard/claude-essentials"

Guides clean, scalable system architecture during the build phase. Use when designing modules, defining boundaries, structuring projects, managing dependencies, or preventing tight coupling and brittleness as systems grow.

2026-05-10122

design.md

from "rileyhilliard/claude-essentials"

Enforces precise, minimal design for dashboards and admin interfaces. Use when building SaaS UIs, data-heavy interfaces, or any product needing Jony Ive-level craft.

2026-05-10122

executing-plans.md

from "rileyhilliard/claude-essentials"

Executes implementation plans with smart task grouping. Groups related tasks to share context, parallelizes across independent subsystems.

2026-05-10122

handling-errors.md

from "rileyhilliard/claude-essentials"

Prevents silent failures and context loss in error handling. Use when writing try-catch blocks, designing error propagation, reviewing catch blocks, or implementing Result patterns.

2026-05-10122

package.json

"author": "rileyhilliard"

"repository": "rileyhilliard/claude-essentials"

Abrir repositorio de GitHub Ver repositorios del creador

$ install --global

$ download --local

Ejecutar en Manus

$ useful --forSOC

Administradores de bases de datosOcupaciones informáticas y matemáticas15-1242L4

name	managing-databases
description	Guides database architecture decisions for PostgreSQL, DuckDB, Parquet, PGVector, and Neo4j. Use when designing schemas, choosing storage strategies, optimizing queries, tuning maintenance, configuring vector search, modeling graph data, or diagnosing performance issues across OLTP, OLAP, similarity search, and graph workloads.

Database Management

Decision guidance for PostgreSQL, DuckDB, Parquet, and Neo4j in hybrid storage architectures.

When to use which database
PostgreSQL quick reference
DuckDB quick reference
Parquet quick reference
PGVector quick reference
Neo4j quick reference
Cross-database conventions
Performance debugging checklist

When to use which database

Workload	Use	Why
Transactional (CRUD, users, sessions)	PostgreSQL	ACID, row-level locking, indexes
Analytical (aggregations, scans)	DuckDB	Columnar, vectorized, parallel
Data storage/interchange	Parquet	Compressed, columnar, portable
Metadata + relationships	PostgreSQL	Foreign keys, constraints
Ad-hoc exploration	DuckDB	Fast on Parquet, no ETL needed
Time-series with point lookups	PostgreSQL + partitioning	Partition pruning + indexes
Time-series analytics	DuckDB on Parquet	Scan performance
Vector similarity search	PostgreSQL + PGVector	HNSW/IVFFlat indexes, hybrid search
RAG / semantic search	PostgreSQL + PGVector	Embeddings + metadata in same DB
Graph traversals / relationships	Neo4j	Native graph, index-free adjacency
Pattern matching / fraud detection	Neo4j	Multi-hop traversal, path finding
Knowledge graphs / ontologies	Neo4j	Flexible schema, relationship-first

Hybrid pattern example:

PostgreSQL: transactional data, relationships, users (metadata)
DuckDB + Parquet: analytical content, aggregations, time-series

PostgreSQL quick reference

Use for: Metadata, relationships, OLTP workloads, anything needing ACID.

Key decisions:

Partition tables >100M rows or with retention requirements
Index columns in WHERE/JOIN clauses, not everything
Tune autovacuum for high-churn tables

See references/postgres-architecture.md for maintenance patterns. See references/postgres-querying.md for advanced query techniques.

DuckDB quick reference

Use for: Analytics, aggregations, Parquet queries, data exploration.

Key decisions:

Prefer Parquet files over CSV (10-100x faster)
Let DuckDB auto-parallelize; don't micro-optimize
For remote data, increase threads beyond CPU count

See references/duckdb-architecture.md for storage and parallelism. See references/duckdb-querying.md for DuckDB-specific SQL features.

Parquet quick reference

Use for: Storing analytical data, data interchange, columnar compression.

Key decisions:

Target 128MB-1GB file sizes
Partition by low-to-moderate cardinality columns (date, region)
Sort by columns used in filters for better pruning

See references/parquet-architecture.md for file design. See references/parquet-querying.md for query optimization.

PGVector quick reference

Use for: Similarity search, RAG applications, semantic search, recommendations.

Key decisions:

HNSW for low-latency, high-recall (default choice)
IVFFlat for memory-constrained or batch-updated data
Use iterative scan for filtered queries
Consider hybrid search (vector + keyword) for 8-15% accuracy boost

See references/pgvector-architecture.md for index configuration. See references/pgvector-querying.md for hybrid search and filtering.

Neo4j quick reference

Use for: Graph traversals, relationship-heavy queries, pattern matching, knowledge graphs.

Key decisions:

Model around your queries, not your source data
Promote properties to nodes when you need to traverse through shared values
Use specific relationship types to avoid supernode bottlenecks
Bound all variable-length paths ([*1..5], never [*])
Use parameters in Cypher for execution plan caching

See references/neo4j-architecture.md for data modeling, indexing, and maintenance. See references/neo4j-querying.md for Cypher optimization and anti-patterns.

Performance debugging checklist

PostgreSQL slow query

Run EXPLAIN (ANALYZE, BUFFERS) on the query
Check for sequential scans on large tables
Verify indexes exist on filter/join columns
Check pg_stat_user_tables for bloat (dead tuples)
Review work_mem if seeing disk sorts

DuckDB slow query

Check if reading CSV instead of Parquet
Verify not doing SELECT * on remote data
Check thread count matches workload
Look for unnecessary type conversions

Parquet slow reads

Verify predicate pushdown is working (check query plan)
Check file sizes (too small = overhead, too large = no parallelism)
Confirm data is sorted by filter columns
Look for high-cardinality partition keys (too many small files)

PGVector slow search

Verify index exists and is being used (EXPLAIN)
Check ef_search (HNSW) or probes (IVFFlat) settings
Enable iterative scan for filtered queries
Check if IVFFlat recall degraded (rebuild index if heavily updated)
Consider partial indexes for common filters

Neo4j slow query

Run PROFILE on the query, read operators bottom-up
Look for AllNodesScan or NodeByLabelScan (missing index)
Check for CartesianProduct (disconnected MATCH patterns)
Verify parameters are used instead of literals (plan caching)
Check for unbounded variable-length paths
Monitor page_cache.hit_ratio (below 98% = need more page cache memory)

managing-databases

Database Management

Contents

When to use which database

PostgreSQL quick reference

DuckDB quick reference

Parquet quick reference

PGVector quick reference

Neo4j quick reference

Performance debugging checklist

PostgreSQL slow query

DuckDB slow query

Parquet slow reads

PGVector slow search

Neo4j slow query

Database Management

Contents

When to use which database

PostgreSQL quick reference

DuckDB quick reference

Parquet quick reference

PGVector quick reference

Neo4j quick reference

Performance debugging checklist

PostgreSQL slow query

DuckDB slow query

Parquet slow reads

PGVector slow search

Neo4j slow query

managing-databases

Database Management

Contents

When to use which database

PostgreSQL quick reference

DuckDB quick reference

Parquet quick reference

PGVector quick reference

Neo4j quick reference

Performance debugging checklist

PostgreSQL slow query

DuckDB slow query

Parquet slow reads

PGVector slow search

Neo4j slow query

Más de este repositorio

Más de este repositorio

Database Management

Contents

When to use which database

PostgreSQL quick reference

DuckDB quick reference

Parquet quick reference

PGVector quick reference

Neo4j quick reference

Performance debugging checklist

PostgreSQL slow query

DuckDB slow query

Parquet slow reads

PGVector slow search

Neo4j slow query