一键在 Manus 中运行任何 Skill

$pwd:

azure-ml-dataset-creator

Name: Azure Ml Dataset Creator
Author: kimtth

// Generate synthetic and simulated datasets for evaluation and fine-tuning using Azure AI Foundry simulators. Create non-adversarial task data, adversarial safety data, and conversation datasets without manual data collection.

在 Manus 中运行

$ git log --oneline --stat

stars:1

forks:0

updated:2025年12月11日 01:48

文件资源管理器

9 个文件

SKILL.md

readonly

related-skills.json

同仓库

azure-ml-llm-trainer.md

from "kimtth/azure-ml-finetuning-eval-skills"

Train or fine-tune LLMs on Azure ML managed compute with TRL trainers. Uses direct trainer loops (SFT, DPO, RL) without relying on serverless APIs or Hugging Face infrastructure.

2025-12-111

azure-ml-model-evaluation.md

from "kimtth/azure-ml-finetuning-eval-skills"

Evaluate generative AI applications and models locally or in the cloud using Azure AI Evaluation SDK. Measure quality, safety, and performance with built-in and custom evaluators.

2025-12-111

package.json

"author": "kimtth"

"repository": "kimtth/azure-ml-finetuning-eval-skills"

打开 GitHub 仓库查看创作者相关仓库

$ install --global

$ download --local

在 Manus 中运行

$ useful --forSOC

数据科学家计算机与数学类职业15-2051L4

name	azure-ml-dataset-creator
description	Generate synthetic and simulated datasets for evaluation and fine-tuning using Azure AI Foundry simulators. Create non-adversarial task data, adversarial safety data, and conversation datasets without manual data collection.
license	See repository root

Azure ML Dataset Creator

Generate synthetic datasets using Azure AI Foundry simulators for evaluation and fine-tuning—replacing manual data collection with automated simulation.

Two simulator types:

Simulator — Non-adversarial task-specific conversations from text, indexes, or custom prompts
AdversarialSimulator — Safety evaluation datasets with jailbreak attacks and harmful content

Use this skill when:

Building evaluation or training datasets without production data
Testing application responses to varied user queries
Red-teaming for safety evaluation
Creating multi-turn conversation datasets
Need cost-effective synthetic data generation

Prerequisites

Azure AI Foundry hub-based project (not Foundry)
Azure OpenAI deployment (GPT-5-mini recommended for cost)
Packages: azure-ai-evaluation, azure-identity
For adversarial: Project in East US 2, France Central, UK South, or Sweden Central

Template Files

These are templates in examples/ directory. Copy and adapt them for your project:

examples/
  ├── generate_qa_from_text.py              # Template: Q&A from Wikipedia/documents
  ├── generate_conversation.py               # Template: Multi-turn conversations
  ├── generate_adversarial.py                # Template: Safety evaluation datasets
  ├── generate_jailbreak_attacks.py          # Template: UPIA/XPIA attack simulation
  ├── generate_with_custom_prompty.py        # Template: Custom simulator behavior
  ├── utils.py                               # Template: Utility functions
  └── custom_simulator_prompty/
      ├── user_override.prompty              # Template: Custom user behavior
      └── query_generator.prompty            # Template: Custom Q&A generation

Do NOT reference these files directly. Copy and adapt them for your project structure.

Quick Start

Generate Q&A from Text

Copy examples/generate_qa_from_text.py and examples/utils.py to your project
Run: python generate_qa_from_text.py
Outputs: training_data.jsonl in chat completion format
- Extracts text from Wikipedia
- Generates Q&A with multiple personas
- Ready for SFT fine-tuning

Generate Multi-Turn Conversations

Copy examples/generate_conversation.py and examples/utils.py to your project
Run: python generate_conversation.py
Outputs: conversation_data.jsonl
- Predefined conversation starters
- Multi-turn dialogue (up to 5 turns)
- User simulator with configurable behavior

Generate Safety Evaluation Data

Copy examples/generate_adversarial.py and examples/utils.py to your project
Run: python generate_adversarial.py
Outputs: adversarial_qa.jsonl, adversarial_conversation.jsonl, adversarial_summarization.jsonl
- Tests responses to harmful/unsafe prompts
- Covers: hate, sexual, violence, self-harm
- Designed for safety evaluator benchmarking

Generate Jailbreak Attacks

Copy examples/generate_jailbreak_attacks.py and examples/utils.py to your project
Run: python generate_jailbreak_attacks.py
Outputs: direct_attack_baseline.jsonl, direct_attack_jailbreak.jsonl, indirect_attack.jsonl
- UPIA: Direct user prompt injection
- XPIA: Context/document injection
- Baseline + attack variants for comparison

Custom Simulator Behavior

Copy examples/generate_with_custom_prompty.py, examples/utils.py, and examples/custom_simulator_prompty/ to your project
Run: python generate_with_custom_prompty.py
Outputs: custom_prompty_data.jsonl
- Override user mood/persona (e.g., "professional")
- Control response diversity (temperature, top_p)
- Custom query-response generation logic

Data Formats

Chat Completion (for SFT fine-tuning)

{
  "messages": [
    {"role": "system", "content": "You are a helpful assistant"},
    {"role": "user", "content": "What is Azure ML?"},
    {"role": "assistant", "content": "Azure Machine Learning is..."}
  ]
}

Q&A Format (for evaluation)

{"query": "What is Azure ML?", "response": "Azure Machine Learning is..."}

See examples/generate_qa_from_text.py for output conversion patterns.

Adversarial Scenarios

Scenario	Enum	Max Samples	Content Types
Q&A	ADVERSARIAL_QA	1,384	Hate, sexual, violence, self-harm
Conversation	ADVERSARIAL_CONVERSATION	1,018	Hate, sexual, violence, self-harm
Summarization	ADVERSARIAL_SUMMARIZATION	525	Hate, sexual, violence, self-harm
Search	ADVERSARIAL_SEARCH	1,000	Hate, sexual, violence, self-harm
Rewrite	ADVERSARIAL_REWRITE	1,000	Hate, sexual, violence, self-harm
Ungrounded Content	ADVERSARIAL_CONTENT_GEN_UNGROUNDED	496	Hate, sexual, violence, self-harm
Grounded Content	ADVERSARIAL_CONTENT_GEN_GROUNDED	475	All + jailbreak
Protected Material	ADVERSARIAL_PROTECTED_MATERIAL	306	Copyright detection

Integration with Training

Generated JSONL files can be uploaded to Azure ML for fine-tuning. Use azureml:// URI paths with azure-ml-llm-trainer skill for SFT/DPO/RL.

See examples/generate_qa_from_text.py for Azure ML data asset creation patterns.

Customization

User Simulator Parameters

Control response diversity and behavior with simulator kwargs. See examples/generate_with_custom_prompty.py for implementation.

Multi-Language Support

Adversarial simulators support multiple languages: Spanish, Italian, French, Japanese, Portuguese, Chinese (Simplified), German. Check example files for language parameter usage.

Callback Pattern

Target application must be defined as async callback accepting messages dict and optional parameters. See examples/generate_qa_from_text.py or examples/generate_conversation.py for callback implementation patterns.

Notes

Synthetic data validation: Always review generated samples before production use
Token costs: Monitor Azure OpenAI quota; use GPT-5-mini for cost efficiency
Context limits: Keep text inputs under 5,000 characters for optimal results
Reproducibility: Set randomization_seed for consistent results across runs
Regional availability: Adversarial simulators require supported regions (see Prerequisites)
Ethical use: Adversarial scenarios for testing/evaluation only; not for malicious use

Common Patterns

Fine-Tuning Dataset: See examples/generate_qa_from_text.py for SFT data generation from text
Safety Benchmarking: See examples/generate_adversarial.py and examples/generate_jailbreak_attacks.py for multi-scenario safety data
Multi-Turn Evaluation: See examples/generate_conversation.py for conversation dataset generation

azure-ml-dataset-creator

同仓库更多 Skills

同仓库更多 Skills

Azure ML Dataset Creator

Prerequisites

Template Files

Quick Start

Generate Q&A from Text

Generate Multi-Turn Conversations

Generate Safety Evaluation Data

Generate Jailbreak Attacks

Custom Simulator Behavior

Data Formats

Chat Completion (for SFT fine-tuning)

Q&A Format (for evaluation)

Adversarial Scenarios

Integration with Training

Customization

User Simulator Parameters

Multi-Language Support

Callback Pattern

Notes

Common Patterns

References

Azure ML Dataset Creator

Prerequisites

Template Files

Quick Start

Generate Q&A from Text

Generate Multi-Turn Conversations

Generate Safety Evaluation Data

Generate Jailbreak Attacks

Custom Simulator Behavior

Data Formats

Chat Completion (for SFT fine-tuning)

Q&A Format (for evaluation)

Adversarial Scenarios

Integration with Training

Customization

User Simulator Parameters

Multi-Language Support

Callback Pattern

Notes

Common Patterns

References