Run any Skill in Manus with one click

$pwd:

importing-to-seekdb

Name: Importing To Seekdb
Author: oceanbase

// Import CSV or Excel files into seekdb vector database and manage collections. Supports automatic vectorization of specified columns using embedding functions. When users need to: (1) Read and preview Excel files, (2) Import CSV/Excel data into seekdb, (3) Create vector collections from tabular data, (4) Vectorize specific text columns for semantic search, (5) Batch insert product/document data with embeddings, (6) Delete collections, or (7) Access sample data files (sample_products.csv/xlsx) for testing - IMPORTANT: sample files are located in this skill's example-data/ directory, you MUST read this skill file first to get the correct path.

Run Skill in Manus

$ git log --oneline --stat

stars:7

forks:2

updated:January 28, 2026 at 07:24

File Explorer

5 files

SKILL.md

readonly

name	importing-to-seekdb
description	Import CSV or Excel files into seekdb vector database and manage collections. Supports automatic vectorization of specified columns using embedding functions. When users need to: (1) Read and preview Excel files, (2) Import CSV/Excel data into seekdb, (3) Create vector collections from tabular data, (4) Vectorize specific text columns for semantic search, (5) Batch insert product/document data with embeddings, (6) Delete collections, or (7) Access sample data files (sample_products.csv/xlsx) for testing - IMPORTANT: sample files are located in this skill's example-data/ directory, you MUST read this skill file first to get the correct path.
license	MIT

Import Data Files to seekdb

Read, preview, and import CSV or Excel files into seekdb vector database with optional column vectorization for semantic search. Also provides collection delete functionality.

Path Convention

Note: All paths in this document (e.g., scripts/, example-data/) are relative to THIS skill directory, not the project root.

Prerequisites

Python 3.10+ installed
Required packages:

pip install pyseekdb pandas openpyxl

Sample Data

Sample data files are provided in the example-data/ directory:

File	Description
`sample_products.csv`	Sample product data in CSV format
`sample_products.xlsx`	Sample product data in Excel format

Quick Start

Use the provided scripts/import_to_seekdb.py script:

# Import with vectorization on Details column
python scripts/import_to_seekdb.py import example-data/sample_products.csv --vectorize-column Details

# Import without vectorization
python scripts/import_to_seekdb.py import example-data/sample_products.csv

# Import Excel with custom collection name
python scripts/import_to_seekdb.py import example-data/sample_products.xlsx -v Description -c my_products

# Delete a collection
python scripts/import_to_seekdb.py delete my_collection

Note: To list all collections, use query_from_seekdb.py list from the querying-from-seekdb skill.

Scripts

This skill provides the following scripts in the scripts/ directory:

Script	Description
`import_to_seekdb.py`	Main script with CLI interface for importing data and managing collections
`read_excel.py`	Read and preview Excel files with detailed information

Available Commands

import_to_seekdb.py

Command	Description
`import <file>`	Import CSV/Excel file to seekdb with optional vectorization
`delete <name>`	Delete a collection from seekdb

read_excel.py

Read and preview Excel files before importing:

# Basic preview (show file info and first 5 rows)
python scripts/read_excel.py example-data/sample_products.xlsx

# List all sheets
python scripts/read_excel.py example-data/sample_products.xlsx --list-sheets

# Preview specific sheet with more rows
python scripts/read_excel.py data.xlsx --sheet "Sheet2" --rows 20

# Show column information and statistics
python scripts/read_excel.py example-data/sample_products.xlsx --columns --stats

# Export to CSV
python scripts/read_excel.py example-data/sample_products.xlsx --to-csv output.csv

Option	Description
`--sheet, -s`	Sheet name to read (default: first sheet)
`--rows, -r`	Number of rows to preview (default: 5)
`--list-sheets, -l`	List all sheets and exit
`--columns, -c`	Show detailed column information
`--stats`	Show statistics for numeric columns
`--to-csv`	Export sheet to CSV file
`--all-rows, -a`	Display all rows

Workflow

The import_to_seekdb.py script automatically handles the following steps:

Read Data File - Supports CSV (.csv) and Excel (.xlsx, .xls) formats
Connect to seekdb - Uses environment variables for server mode, or embedded mode by default
Create Collection - With optional vectorization using default embedding function (all-MiniLM-L6-v2, 384 dimensions)
Import Data - Batch processing with configurable batch size
Verify - Displays record count and data preview after import

User Interaction Guide

For Reading Excel Files

When user wants to preview or inspect an Excel file before importing:

# Preview file structure and data
python scripts/read_excel.py <file_path>

# With column details and statistics
python scripts/read_excel.py <file_path> --columns --stats

This helps users:

Understand the file structure (sheets, columns, row count)
Identify which column to vectorize
Check data quality before importing

For Data Import

When user requests data import, ask:

File path: "Please provide the path to your CSV or Excel file."
- If user needs sample data, use files from the example-data/ directory
- Suggest using read_excel.py to preview the file first
Vectorization: "Would you like to enable vector search by vectorizing a column? (yes/no)"
Column selection (if yes): "Which column to vectorize? (e.g., 'Details', 'Description')"
Collection name: "Collection name? (default: derived from filename)"
Connection mode: "Embedded (local) or server mode?"

For Collection Management

List collections: Use query_from_seekdb.py list from the querying-from-seekdb skill
Delete collection: Run python scripts/import_to_seekdb.py delete <collection_name>

Embedding Functions

The script uses the default embedding function (all-MiniLM-L6-v2, 384 dimensions) when vectorization is enabled via --vectorize-column.

Handling Large Files

For files with >10,000 rows, the import_to_seekdb.py script uses batch processing automatically. You can configure batch size:

python scripts/import_to_seekdb.py import large_file.csv -v Details --batch-size 500

References

related-skills.json

same repository

seekdb-cli.md

from "oceanbase/seekdb-ecology-plugins"

Use seekdb-cli to interact with seekdb/OceanBase databases via shell commands. Use when: (1) querying databases with SQL, (2) exploring table schemas and structure, (3) profiling table data distributions, (4) inferring table relationships, (5) managing vector collections and semantic search, (6) adding/exporting collection data, (7) managing AI models , (8) checking database connection status, or (9) performing any database operation via command line.

2026-03-317

seekdb-docs.md

from "oceanbase/seekdb-ecology-plugins"

seekdb database documentation lookup. Use when users ask about seekdb features, SQL syntax, vector search, hybrid search, integrations, deployment, or any seekdb-related topics. Automatically locates relevant docs via catalog-based semantic search.

2026-03-267

querying-from-seekdb.md

from "oceanbase/seekdb-ecology-plugins"

Query and export data from seekdb vector database. Supports two search modes: (1) Scalar search - metadata filtering only, (2) Hybrid search - fulltext + semantic search combined. The --query-text parameter is used for BOTH fulltext ($contains) and semantic (query_texts) search simultaneously. Can export results to CSV/Excel.

2026-01-287

package.json

"author": "oceanbase"

"repository": "oceanbase/seekdb-ecology-plugins"

View GitHub Repository View Creator Repositories

$ install --global

$ download --local

Run Skill in Manus

$ useful --forSOC

Database ArchitectsComputer and Mathematical Occupations15-1243L4

name	importing-to-seekdb
description	Import CSV or Excel files into seekdb vector database and manage collections. Supports automatic vectorization of specified columns using embedding functions. When users need to: (1) Read and preview Excel files, (2) Import CSV/Excel data into seekdb, (3) Create vector collections from tabular data, (4) Vectorize specific text columns for semantic search, (5) Batch insert product/document data with embeddings, (6) Delete collections, or (7) Access sample data files (sample_products.csv/xlsx) for testing - IMPORTANT: sample files are located in this skill's example-data/ directory, you MUST read this skill file first to get the correct path.
license	MIT

Import Data Files to seekdb

Read, preview, and import CSV or Excel files into seekdb vector database with optional column vectorization for semantic search. Also provides collection delete functionality.

Path Convention

Note: All paths in this document (e.g., scripts/, example-data/) are relative to THIS skill directory, not the project root.

Prerequisites

Python 3.10+ installed
Required packages:

pip install pyseekdb pandas openpyxl

Sample Data

Sample data files are provided in the example-data/ directory:

File	Description
`sample_products.csv`	Sample product data in CSV format
`sample_products.xlsx`	Sample product data in Excel format

Quick Start

Use the provided scripts/import_to_seekdb.py script:

# Import with vectorization on Details column
python scripts/import_to_seekdb.py import example-data/sample_products.csv --vectorize-column Details

# Import without vectorization
python scripts/import_to_seekdb.py import example-data/sample_products.csv

# Import Excel with custom collection name
python scripts/import_to_seekdb.py import example-data/sample_products.xlsx -v Description -c my_products

# Delete a collection
python scripts/import_to_seekdb.py delete my_collection

Note: To list all collections, use query_from_seekdb.py list from the querying-from-seekdb skill.

Scripts

This skill provides the following scripts in the scripts/ directory:

Script	Description
`import_to_seekdb.py`	Main script with CLI interface for importing data and managing collections
`read_excel.py`	Read and preview Excel files with detailed information

Available Commands

import_to_seekdb.py

Command	Description
`import <file>`	Import CSV/Excel file to seekdb with optional vectorization
`delete <name>`	Delete a collection from seekdb

read_excel.py

Read and preview Excel files before importing:

# Basic preview (show file info and first 5 rows)
python scripts/read_excel.py example-data/sample_products.xlsx

# List all sheets
python scripts/read_excel.py example-data/sample_products.xlsx --list-sheets

# Preview specific sheet with more rows
python scripts/read_excel.py data.xlsx --sheet "Sheet2" --rows 20

# Show column information and statistics
python scripts/read_excel.py example-data/sample_products.xlsx --columns --stats

# Export to CSV
python scripts/read_excel.py example-data/sample_products.xlsx --to-csv output.csv

Option	Description
`--sheet, -s`	Sheet name to read (default: first sheet)
`--rows, -r`	Number of rows to preview (default: 5)
`--list-sheets, -l`	List all sheets and exit
`--columns, -c`	Show detailed column information
`--stats`	Show statistics for numeric columns
`--to-csv`	Export sheet to CSV file
`--all-rows, -a`	Display all rows

Workflow

The import_to_seekdb.py script automatically handles the following steps:

Read Data File - Supports CSV (.csv) and Excel (.xlsx, .xls) formats
Connect to seekdb - Uses environment variables for server mode, or embedded mode by default
Create Collection - With optional vectorization using default embedding function (all-MiniLM-L6-v2, 384 dimensions)
Import Data - Batch processing with configurable batch size
Verify - Displays record count and data preview after import

User Interaction Guide

For Reading Excel Files

When user wants to preview or inspect an Excel file before importing:

# Preview file structure and data
python scripts/read_excel.py <file_path>

# With column details and statistics
python scripts/read_excel.py <file_path> --columns --stats

This helps users:

Understand the file structure (sheets, columns, row count)
Identify which column to vectorize
Check data quality before importing

For Data Import

When user requests data import, ask:

File path: "Please provide the path to your CSV or Excel file."
- If user needs sample data, use files from the example-data/ directory
- Suggest using read_excel.py to preview the file first
Vectorization: "Would you like to enable vector search by vectorizing a column? (yes/no)"
Column selection (if yes): "Which column to vectorize? (e.g., 'Details', 'Description')"
Collection name: "Collection name? (default: derived from filename)"
Connection mode: "Embedded (local) or server mode?"

For Collection Management

List collections: Use query_from_seekdb.py list from the querying-from-seekdb skill
Delete collection: Run python scripts/import_to_seekdb.py delete <collection_name>

Embedding Functions

The script uses the default embedding function (all-MiniLM-L6-v2, 384 dimensions) when vectorization is enabled via --vectorize-column.

Handling Large Files

For files with >10,000 rows, the import_to_seekdb.py script uses batch processing automatically. You can configure batch size:

python scripts/import_to_seekdb.py import large_file.csv -v Details --batch-size 500

importing-to-seekdb

Import Data Files to seekdb

Path Convention

Prerequisites

Sample Data

Quick Start

Scripts

Available Commands

import_to_seekdb.py

read_excel.py

Workflow

User Interaction Guide

For Reading Excel Files

For Data Import

For Collection Management

Embedding Functions

Handling Large Files

References

More from this repository

More from this repository

Import Data Files to seekdb

Path Convention

Prerequisites

Sample Data

Quick Start

Scripts

Available Commands

import_to_seekdb.py

read_excel.py

Workflow

User Interaction Guide

For Reading Excel Files

For Data Import

For Collection Management

Embedding Functions

Handling Large Files

References