Run any Skill in Manus with one click

Get Started

excel-conditional-filtering-optimization

根据多维数值条件筛选 Excel 数据并导出结果，支持大规模数据的自动性能优化处理。

Run Skill in Manus

Overview

根据多维数值条件筛选 Excel 数据并导出结果，支持大规模数据的自动性能优化处理。

Install command

npx skills add https://github.com/OpenSenseNova/SenseNova-Skills --skill excel-conditional-filtering-optimization

Copy and paste this command into Claude Code to install the skill

Source

OpenSenseNova/SenseNova-Skills

Stars3,284

Forks238

UpdatedApril 26, 2026 at 11:07

SKILL.md

readonly

name	excel-conditional-filtering-optimization
description	根据多维数值条件筛选 Excel 数据并导出结果，支持大规模数据的自动性能优化处理。

Excel_Conditional_Filtering_Optimization

Note: This sub-skill covers one step of the Excel analysis workflow. For the full pipeline (file reading, row counting, large-file optimization, export), see the parent workflow SKILL.md.

Step1 读取 Excel 文件中所有工作表的数据，统计各表行数并汇总，用于评估数据规模。

import pandas as pd

file_path = "input_data.xlsx"

# 读取所有 sheet，统计行数
xls = pd.ExcelFile(file_path)
print("Sheet names:", xls.sheet_names)

total_rows = 0
sheet_details = []
for sheet in xls.sheet_names:
    df_temp = pd.read_excel(file_path, sheet_name=sheet)
    row_count = len(df_temp)
    sheet_details.append({"sheet": sheet, "rows": row_count})
    total_rows += row_count

print(f"Sheet details: {sheet_details}")
print(f"Total rows across all sheets: {total_rows}")

Step2 对目标数据进行清洗，处理表头偏移，并将关键列转换为数值类型以确保计算准确。

# 读取目标数据表
target_sheet = 'Sheet1'
df = pd.read_excel(file_path, sheet_name=target_sheet, header=0)

# 处理可能的子表头或空行偏移（示例：跳过第一行）
# df = df.iloc[1:].reset_index(drop=True)

# 统一设置列名（根据实际业务逻辑调整占位符）
# df.columns = ['col_1', 'col_2', 'col_3', 'target_id', 'val_a', 'val_b', 'val_c']

# 强制转换数值列，处理非数值数据为 NaN
numeric_cols = ['val_a', 'val_b', 'val_c', 'target_id']
for col in numeric_cols:
    if col in df.columns:
        df[col] = pd.to_numeric(df[col], errors='coerce')

# 处理合并单元格（如有）
# df = df.ffill()

Step3 执行多维度条件筛选逻辑，提取符合特定数值特征的唯一记录。

# 筛选逻辑：例如 val_a, val_b, val_c 同时满足特定阈值（如均为 0）
mask = (df['val_a'] == 0) & (df['val_b'] == 0) & (df['val_c'] == 0)
filtered_df = df[mask][['target_id', 'val_a', 'val_b', 'val_c']]

# 提取唯一编号并去除空值
result = filtered_df.drop_duplicates().dropna(subset=['target_id']).reset_index(drop=True)

Step4 将筛选后的结果保存为新的 Excel 文件，并生成下载链接。

output_path = "filtered_analysis_result.xlsx"

# 格式化输出列名
result.columns = ['Target_Index', 'Value_A', 'Value_B', 'Value_C']

# 导出文件
result.to_excel(output_path, index=False)

# 打印结果摘要与下载路径
print(f"Filtered records count: {len(result)}")
print(f"Result saved to: {output_path}")

More from this repository

same repository

sn-ppt-creative

OpenSenseNova/SenseNova-Skills

Creative-mode PPT pipeline. One full-page 16:9 PNG per slide. LLM / VLM calls go through sn-ppt-standard/lib/model_client.py (shared thin client). Text-to-image (the actual png rendering) goes through sn-image-base/scripts/sn_agent_runner.py. Expects task_pack.json + info_pack.json already written by sn-ppt-entry.

2026-06-013.3k

sn-ppt-standard

OpenSenseNova/SenseNova-Skills

Standard-mode PPT pipeline. All LLM / VLM / T2I calls are wrapped in a single CLI entry (scripts/run_stage.py). The main agent's job is simple: emit ONE shell command per stage, never write loops, never write prompts.

2026-06-013.3k

sn-search-image

OpenSenseNova/SenseNova-Skills

USE FOR Google-backed image discovery via Serper.dev. Returns image URLs, page URLs, titles, and source domains.

2026-06-013.3k

sn-image-base

OpenSenseNova/SenseNova-Skills

Base-layer skill for the SenseNova-Skills project, providing low-level APIs for image generation, recognition (VLM), and text optimization (LLM). This skill does not preprocess inputs; it only calls backend services and returns results. This skill is not user-facing and is intended for upper-layer skills only.

2026-05-273.3k

sn-infographic

OpenSenseNova/SenseNova-Skills

Generates professional infographics with various layout types and visual styles. Analyzes content, recommends layout and style, and generates publication-ready infographics. Use when user asks to create "infographic", "信息图", "visual summary", or "可视化".

2026-05-273.3k

sn-ppt-doctor

OpenSenseNova/SenseNova-Skills

Environment diagnostic for the PPT family. Validates sn-image-base, API keys, Node runtime, and optional deps; interactively writes .env for required vars. Runs before sn-ppt-entry; does not modify sn-image-* skills.

2026-05-253.3k

Source

OpenSenseNova

OpenSenseNova/SenseNova-Skills

View GitHub Repository View Creator Repositories

Install command

Download

Run Skill in Manus

Useful forSOC

Data ScientistsComputer and Mathematical Occupations15-2051L4

name	excel-conditional-filtering-optimization
description	根据多维数值条件筛选 Excel 数据并导出结果，支持大规模数据的自动性能优化处理。

Excel_Conditional_Filtering_Optimization

Note: This sub-skill covers one step of the Excel analysis workflow. For the full pipeline (file reading, row counting, large-file optimization, export), see the parent workflow SKILL.md.

Step1 读取 Excel 文件中所有工作表的数据，统计各表行数并汇总，用于评估数据规模。

import pandas as pd

file_path = "input_data.xlsx"

# 读取所有 sheet，统计行数
xls = pd.ExcelFile(file_path)
print("Sheet names:", xls.sheet_names)

total_rows = 0
sheet_details = []
for sheet in xls.sheet_names:
    df_temp = pd.read_excel(file_path, sheet_name=sheet)
    row_count = len(df_temp)
    sheet_details.append({"sheet": sheet, "rows": row_count})
    total_rows += row_count

print(f"Sheet details: {sheet_details}")
print(f"Total rows across all sheets: {total_rows}")

Step2 对目标数据进行清洗，处理表头偏移，并将关键列转换为数值类型以确保计算准确。

# 读取目标数据表
target_sheet = 'Sheet1'
df = pd.read_excel(file_path, sheet_name=target_sheet, header=0)

# 处理可能的子表头或空行偏移（示例：跳过第一行）
# df = df.iloc[1:].reset_index(drop=True)

# 统一设置列名（根据实际业务逻辑调整占位符）
# df.columns = ['col_1', 'col_2', 'col_3', 'target_id', 'val_a', 'val_b', 'val_c']

# 强制转换数值列，处理非数值数据为 NaN
numeric_cols = ['val_a', 'val_b', 'val_c', 'target_id']
for col in numeric_cols:
    if col in df.columns:
        df[col] = pd.to_numeric(df[col], errors='coerce')

# 处理合并单元格（如有）
# df = df.ffill()

Step3 执行多维度条件筛选逻辑，提取符合特定数值特征的唯一记录。

# 筛选逻辑：例如 val_a, val_b, val_c 同时满足特定阈值（如均为 0）
mask = (df['val_a'] == 0) & (df['val_b'] == 0) & (df['val_c'] == 0)
filtered_df = df[mask][['target_id', 'val_a', 'val_b', 'val_c']]

# 提取唯一编号并去除空值
result = filtered_df.drop_duplicates().dropna(subset=['target_id']).reset_index(drop=True)

Step4 将筛选后的结果保存为新的 Excel 文件，并生成下载链接。

output_path = "filtered_analysis_result.xlsx"

# 格式化输出列名
result.columns = ['Target_Index', 'Value_A', 'Value_B', 'Value_C']

# 导出文件
result.to_excel(output_path, index=False)

# 打印结果摘要与下载路径
print(f"Filtered records count: {len(result)}")
print(f"Result saved to: {output_path}")