一键在 Manus 中运行任何 Skill

dct-infer

星标0

分支0

更新时间2026年2月9日 02:47

Use this skill when the user wants to generate SQL CREATE TABLE statements from data files, infer schema from CSV/JSON/Parquet, create database schemas from existing data, or get column types from a file. Triggers include "generate schema", "create table from csv", "infer types", "what's the schema", "get column types", "sql ddl", or when preparing data for SQL databases like DuckDB, PostgreSQL, or similar.

安装

用 Codex 或 Claude 帮你安装复制这段 Prompt，粘贴到 Codex、Claude 或其他助手里，让它检查 Skill 页面并帮你完成安装。

在 Manus 中运行

来源

andrew-a-hale

andrew-a-hale/dct

打开 GitHub 仓库查看创作者相关仓库

下载

在 Manus 中运行

DCT Infer - Generate SQL Schema

Create DuckDB-compatible CREATE TABLE statements by analyzing data file contents.

When to Use

Use this skill when you need to:

Create database tables from existing data files
Document the schema of a dataset
Generate DDL for ETL pipelines
Understand column types in a file
Prepare data for SQL-based analysis

Installation

which dct || go build -o dct && chmod +x ./dct

Usage

dct infer <file> [flags]

Flags

-t, --table <name>: Table name (default: "default")
-n, --lines <number>: Number of lines to analyze for type inference (useful for large files)
-o, --output <file>: Output to file instead of stdout

Examples

Basic schema inference:

dct infer data.csv

With custom table name:

dct infer data.parquet -t events

Save schema to file:

dct infer large.ndjson -n 1000 -t users -o schema.sql

Infer from specific number of rows:

dct infer bigfile.csv -n 500 -t transactions

Output Format

DuckDB-compatible CREATE TABLE statement:

create table users (
    "id" bigint,
    "name" varchar,
    "email" varchar,
    "created_at" timestamp,
    "is_active" boolean
)

Supported Data Types

The inferred schema uses DuckDB types:

bigint - 64-bit integers
integer - 32-bit integers
double - Floating point numbers
varchar - String/text data
timestamp - Date and time
date - Date only
time - Time only
boolean - True/false values
array(...) - Array columns
row(...) - Struct/nested columns

Best Practices

Use -n flag for large files to speed up inference
Column names are quoted to handle special characters
Output is compatible with DuckDB and similar SQL databases
For Parquet files, types are read directly from metadata
For CSV/JSON, types are inferred from sample data

Integration Examples

With DuckDB

# Create table directly
dct infer data.csv -t my_table | duckdb mydb.duckdb

# Or save and execute
dct infer data.csv -t my_table -o schema.sql
duckdb mydb.duckdb < schema.sql

In Scripts

#!/bin/bash
for file in *.csv; do
    dct infer "$file" -t "$(basename "$file" .csv)" > "${file%.csv}.sql"
done

Related Skills

dct-peek: Preview data before inferring schema
dct-profile: Check data quality before creating tables

同仓库更多 Skills

同仓库

dct-chart

andrew-a-hale/dct

Use this skill when the user wants to visualize data distributions, create ASCII histograms, generate simple charts from CSV/JSON data, plot column values, or see value frequencies in terminal-friendly format. Triggers include "chart this data", "visualize distribution", "histogram of values", "plot the data", "ascii chart", "terminal visualization", or when needing quick visual analysis without external plotting tools.

2026-02-090

dct-diff

andrew-a-hale/dct

Use this skill when the user wants to compare two data files, find differences between datasets, validate data consistency, check if files have matching records, or reconcile data between sources. Triggers include "compare these files", "diff the datasets", "are these the same", "find differences", "validate data matches", "reconcile", "data comparison", or when doing data quality validation between two files.

2026-02-090

dct-flattify

andrew-a-hale/dct

Use this skill when the user wants to flatten nested JSON structures, convert nested objects to flat format, generate SQL queries from nested JSON, unnest hierarchical data, or work with nested API responses that need to be tabular. Triggers include "flatten this json", "make json flat", "nested to flat", "unnest json", "json to sql", "flatten nested", or when dealing with deeply nested JSON from APIs or document stores.

2026-02-090

dct-generate

andrew-a-hale/dct

Use this skill when the user wants to create synthetic test data, generate fake datasets, create mock data for testing, produce realistic data with specific patterns, or need sample data with custom schemas. Triggers include "generate test data", "create fake data", "mock dataset", "synthetic data", "generate sample records", "create test data", "fake users", "mock data", or when needing test data with specific fields and relationships.

2026-02-090

dct-js2sql

andrew-a-hale/dct

Use this skill when the user wants to convert JSON Schema to SQL CREATE TABLE statements, transform schema definitions to database DDL, create SQL tables from JSON Schema files, or generate database schemas from API specifications. Triggers include "json schema to sql", "convert schema to sql", "create table from json schema", "json schema ddl", "schema conversion", or when working with OpenAPI, JSON Schema, or API specifications that need database tables.

2026-02-090

dct-peek

andrew-a-hale/dct

Use this skill when the user wants to preview or inspect the contents of a data file (CSV, JSON, NDJSON, Parquet). Triggers include "show me the data", "preview this file", "what's in this csv", "look at the first rows", "sample the data", or when needing to understand data structure before processing. This is often the first step before other data operations.

2026-02-090

name

dct-infer

description

DCT Infer - Generate SQL Schema

Create DuckDB-compatible CREATE TABLE statements by analyzing data file contents.

When to Use

Use this skill when you need to:

Create database tables from existing data files
Document the schema of a dataset
Generate DDL for ETL pipelines
Understand column types in a file
Prepare data for SQL-based analysis

Installation

which dct || go build -o dct && chmod +x ./dct

Usage

dct infer <file> [flags]

Flags

-t, --table <name>: Table name (default: "default")
-n, --lines <number>: Number of lines to analyze for type inference (useful for large files)
-o, --output <file>: Output to file instead of stdout

Examples

Basic schema inference:

dct infer data.csv

With custom table name:

dct infer data.parquet -t events

Save schema to file:

dct infer large.ndjson -n 1000 -t users -o schema.sql

Infer from specific number of rows:

dct infer bigfile.csv -n 500 -t transactions

Output Format

DuckDB-compatible CREATE TABLE statement:

create table users (
    "id" bigint,
    "name" varchar,
    "email" varchar,
    "created_at" timestamp,
    "is_active" boolean
)

Supported Data Types

The inferred schema uses DuckDB types:

bigint - 64-bit integers
integer - 32-bit integers
double - Floating point numbers
varchar - String/text data
timestamp - Date and time
date - Date only
time - Time only
boolean - True/false values
array(...) - Array columns
row(...) - Struct/nested columns

Best Practices

Use -n flag for large files to speed up inference
Column names are quoted to handle special characters
Output is compatible with DuckDB and similar SQL databases
For Parquet files, types are read directly from metadata
For CSV/JSON, types are inferred from sample data

Integration Examples

With DuckDB

# Create table directly
dct infer data.csv -t my_table | duckdb mydb.duckdb

# Or save and execute
dct infer data.csv -t my_table -o schema.sql
duckdb mydb.duckdb < schema.sql

In Scripts

#!/bin/bash
for file in *.csv; do
    dct infer "$file" -t "$(basename "$file" .csv)" > "${file%.csv}.sql"
done

Related Skills

dct-peek: Preview data before inferring schema
dct-profile: Check data quality before creating tables