Exécutez n'importe quel Skill dans Manus
en un clic

Exécutez n'importe quel Skill dans Manus en un clic

$pwd:

corvus-ecma-regex

Name: Corvus Ecma Regex
Author: corvus-dotnet

// Translate ECMAScript 262 /u mode regular expressions to .NET regex patterns. Covers semantic differences between ECMAScript and .NET regex engines, supplementary code point handling via surrogate pairs, Unicode property escapes, backreference conditionals, character class strategies, and strict /u mode validation. USE FOR: understanding regex translation in generated validation code, debugging pattern matching issues in JSON Schema pattern/patternProperties, extending regex support for new Unicode properties or constructs. DO NOT USE FOR: general .NET regex usage, writing custom regex patterns.

Exécuter dans Manus

$ git log --oneline --stat

stars:184

forks:20

updated:12 mai 2026 à 16:43

SKILL.md

readonly

name

corvus-ecma-regex

description

Translate ECMAScript 262 /u mode regular expressions to .NET regex patterns. Covers semantic differences between ECMAScript and .NET regex engines, supplementary code point handling via surrogate pairs, Unicode property escapes, backreference conditionals, character class strategies, and strict /u mode validation. USE FOR: understanding regex translation in generated validation code, debugging pattern matching issues in JSON Schema pattern/patternProperties, extending regex support for new Unicode properties or constructs. DO NOT USE FOR: general .NET regex usage, writing custom regex patterns.

ECMAScript Regex Translation

Entry Point API

The translator is in src/Corvus.Text.Json.CodeGeneration/EcmaRegexTranslator.cs:

// Translate an ECMAScript /u mode pattern to .NET
string dotnetPattern = EcmaRegexTranslator.Translate(@"\d+\.\d+");

// Non-throwing variant with span output
OperationStatus status = EcmaRegexTranslator.TryTranslate(
    ecmaPattern, buffer, out int charsWritten);

// Safe fallback — returns original pattern if translation fails
string pattern = EcmaRegexTranslator.TranslateOrFallback(ecmaPattern);

Translation Examples

ECMAScript Pattern	.NET Translation	Reason
`.`	`[^\n\r\u2028\u2029]`	Dot excludes specific line terminators
`\d`	`[0-9]`	ASCII digits only
`\u{1F600}`	`(?:\uD83D\uDE00)`	Supplementary char → surrogate pair
`(a)\1`	`(a)(?(1)\1)`	Backreference wrapped in conditional
`[a\D]`	`(?:[a]\|[^0-9])`	Negated shorthand in class uses alternation

Why Translation Is Needed

JSON Schema's pattern keyword uses ECMAScript regex semantics (ECMA 262 /u mode). .NET's System.Text.RegularExpressions has different semantics for several constructs. The translator converts ECMAScript patterns to equivalent .NET patterns.

Key Translation Rules

Character Class Shorthands

ECMAScript	.NET Translation	Reason
`\d`	`[0-9]`	.NET `\d` matches Unicode digits; ECMAScript only ASCII
`\w`	`[a-zA-Z0-9_]`	.NET `\w` matches Unicode word chars
`\s`	Explicit ECMAScript whitespace set	Different whitespace sets
`.`	`[^\n\r\u2028\u2029]`	ECMAScript excludes 4 line terminators

Word Boundaries

\b → explicit lookaround assertions using ASCII word characters only (ECMAScript definition).

Supplementary Code Points

\u{XXXXX} (code points above U+FFFF) → (?:\uHHHH\uLLLL) surrogate pair in .NET.

Backreferences

ECMAScript treats non-participating groups as always-matching. Translated to: (?(N)\N) — .NET conditional syntax that checks if group N participated.

Unicode Property Escapes

\p{Script=Latin} → expanded character class ranges (34 BMP + 5 supplementary ranges).

Binary properties like \p{Emoji} → equivalent .NET character class unions.

Performance Characteristics

The translator is zero-allocation using:

ref struct translator type
stackalloc for small intermediate buffers
ArrayPool<char> for larger buffers

The translated regex pattern is then compiled using RegexOptions.Compiled for repeated use in validation.

Regex Pattern Classification (at code-gen time)

Before translating, the code generator classifies patterns:

Classification	Example	Optimization
`Noop`	`.`, `^.$`	Skip validation entirely
`NonEmpty`	`.+`	Simple length > 0 check
`Prefix`	`^foo`	`StartsWith("foo")`
`Range`	`[a-z]`	Inline character range check
`FullRegex`	Everything else	Full compiled regex

Cross-References

For the evaluator that uses translated patterns, see corvus-standalone-evaluator
For the keyword that drives pattern validation, see corvus-keywords-and-validation
Full reference: docs/EcmaRegexTranslator.md, docs/EcmaRegexTranslations.md

related-skills.json

même dépôt

corvus-analyzers.md

from "corvus-dotnet/Corvus.JsonSchema"

Understand and work with the Roslyn analyzers shipped with Corvus.Text.Json. Covers 10 production diagnostics (CTJ001-CTJ010) for correct and performant V5 code, the CTJ-NAV refactoring for navigating from types to JSON Schema definitions, and the analyzer packaging convention. USE FOR: understanding what each analyzer checks, writing code that passes analyzer checks, packaging analyzer DLLs. DO NOT USE FOR: the 25 migration analyzers (use corvus-v4-migration for the workflow and CVJ001-CVJ025 reference).

2026-05-12184

corvus-benchmarks.md

from "corvus-dotnet/Corvus.JsonSchema"

Run, interpret, and maintain BenchmarkDotNet benchmarks for JSON Schema validation and query languages. Covers the B/ (frozen baseline) vs C/ (current) directory convention, stale Job-* cleanup, --buildTimeout, result file polling, regenerating C/ models after codegen changes, and JSONata/JMESPath/JsonLogic/JSONPath benchmarks. USE FOR: running benchmarks, interpreting results, regenerating benchmark models, troubleshooting BDN issues, adding new benchmark schemas. DO NOT USE FOR: general .NET performance analysis (use the analyzing-dotnet-performance skill).

2026-05-12184

corvus-bowtie-testing.md

from "corvus-dotnet/Corvus.JsonSchema"

Test Corvus.JsonSchema against the JSON Schema Test Suite using Bowtie, the cross-implementation meta-validator. Covers local package building, configuring a local Bowtie checkout to use locally-built packages, running the test suite via Docker/Podman containers, interpreting results, the iteration loop, and teardown. Both V4 (dotnet-corvus-jsonschema-v4engine) and V5 (dotnet-corvus-jsonschema-v5engine) implementations are supported. USE FOR: running Bowtie conformance suites against local changes, setting up the local development loop, interpreting Bowtie failure reports, testing schema dialect compliance (Draft 4 through 2020-12). DO NOT USE FOR: running the in-repo MSTest test suite (use corvus-build-and-test), regenerating test classes from the submodule (use corvus-test-suite-regeneration).

2026-05-12184

corvus-buffer-and-pooling.md

from "corvus-dotnet/Corvus.JsonSchema"

Write allocation-efficient buffer code in Corvus.JsonSchema using the codebase's established three-tier pooling pattern: stackalloc → ArrayPool → ThreadStatic caches. Covers threshold constants, the rent/return pattern, UTF-8-first processing, thread-local writer and workspace caches, and PooledByteBufferWriter. USE FOR: writing any code that needs temporary byte/char buffers, adding new pooled caches, working with UTF-8 data, avoiding heap allocation on hot paths. DO NOT USE FOR: choosing which ref-struct collection to use (use corvus-low-alloc-data-structures), document model internals (use corvus-parsed-documents-and-memory).

2026-05-12184

corvus-build-and-test.md

from "corvus-dotnet/Corvus.JsonSchema"

Build, test, and run the Corvus.JsonSchema solution correctly. Covers multi-targeting (net9.0/net10.0/net481/netstandard2.0), mandatory test category filters, solution file selection, running specific test classes or methods, writing new tests, and diagnosing common build/test failures. USE FOR: building the solution, running tests, writing new test files, diagnosing test failures, understanding TFM targeting, finding the right test project for a feature area. DO NOT USE FOR: benchmark execution (use corvus-benchmarks), code generation (use corvus-codegen), test suite regeneration (use corvus-test-suite-regeneration).

2026-05-12184

corvus-codegen.md

from "corvus-dotnet/Corvus.JsonSchema"

Generate strongly-typed C# from JSON Schema using the Roslyn source generator or the corvusjson CLI tool. Covers the JsonSchemaTypeGenerator attribute, CLI options, naming heuristics, AdditionalFiles registration, config file format, MSBuild properties, and troubleshooting generated output. USE FOR: generating types from schemas, configuring the source generator or CLI tool, understanding naming heuristics, inspecting generated output, troubleshooting generation issues. DO NOT USE FOR: modifying the generator internals (use corvus-keywords-and-validation), running benchmarks (use corvus-benchmarks).

2026-05-12184

package.json

"author": "corvus-dotnet"

"repository": "corvus-dotnet/Corvus.JsonSchema"

Ouvrir le dépôt GitHub Voir les dépôts du créateur

$ install --global

$ download --local

Exécuter dans Manus

$ useful --forSOC

Développeurs de logicielsProfessions informatiques et mathématiques15-1252L4

name

corvus-ecma-regex

description

ECMAScript Regex Translation

Entry Point API

The translator is in src/Corvus.Text.Json.CodeGeneration/EcmaRegexTranslator.cs:

// Translate an ECMAScript /u mode pattern to .NET
string dotnetPattern = EcmaRegexTranslator.Translate(@"\d+\.\d+");

// Non-throwing variant with span output
OperationStatus status = EcmaRegexTranslator.TryTranslate(
    ecmaPattern, buffer, out int charsWritten);

// Safe fallback — returns original pattern if translation fails
string pattern = EcmaRegexTranslator.TranslateOrFallback(ecmaPattern);

Translation Examples

ECMAScript Pattern	.NET Translation	Reason
`.`	`[^\n\r\u2028\u2029]`	Dot excludes specific line terminators
`\d`	`[0-9]`	ASCII digits only
`\u{1F600}`	`(?:\uD83D\uDE00)`	Supplementary char → surrogate pair
`(a)\1`	`(a)(?(1)\1)`	Backreference wrapped in conditional
`[a\D]`	`(?:[a]\|[^0-9])`	Negated shorthand in class uses alternation

Why Translation Is Needed

Key Translation Rules

Character Class Shorthands

ECMAScript	.NET Translation	Reason
`\d`	`[0-9]`	.NET `\d` matches Unicode digits; ECMAScript only ASCII
`\w`	`[a-zA-Z0-9_]`	.NET `\w` matches Unicode word chars
`\s`	Explicit ECMAScript whitespace set	Different whitespace sets
`.`	`[^\n\r\u2028\u2029]`	ECMAScript excludes 4 line terminators

Word Boundaries

\b → explicit lookaround assertions using ASCII word characters only (ECMAScript definition).

Supplementary Code Points

\u{XXXXX} (code points above U+FFFF) → (?:\uHHHH\uLLLL) surrogate pair in .NET.

Backreferences

ECMAScript treats non-participating groups as always-matching. Translated to: (?(N)\N) — .NET conditional syntax that checks if group N participated.

Unicode Property Escapes

\p{Script=Latin} → expanded character class ranges (34 BMP + 5 supplementary ranges).

Binary properties like \p{Emoji} → equivalent .NET character class unions.

Performance Characteristics

The translator is zero-allocation using:

ref struct translator type
stackalloc for small intermediate buffers
ArrayPool<char> for larger buffers

The translated regex pattern is then compiled using RegexOptions.Compiled for repeated use in validation.

Regex Pattern Classification (at code-gen time)

Before translating, the code generator classifies patterns:

Classification	Example	Optimization
`Noop`	`.`, `^.$`	Skip validation entirely
`NonEmpty`	`.+`	Simple length > 0 check
`Prefix`	`^foo`	`StartsWith("foo")`
`Range`	`[a-z]`	Inline character range check
`FullRegex`	Everything else	Full compiled regex

Cross-References

For the evaluator that uses translated patterns, see corvus-standalone-evaluator
For the keyword that drives pattern validation, see corvus-keywords-and-validation
Full reference: docs/EcmaRegexTranslator.md, docs/EcmaRegexTranslations.md

corvus-ecma-regex

ECMAScript Regex Translation

Entry Point API

Translation Examples

Why Translation Is Needed

Key Translation Rules

Character Class Shorthands

Word Boundaries

Supplementary Code Points

Backreferences

Unicode Property Escapes

Performance Characteristics

Regex Pattern Classification (at code-gen time)

Cross-References

Plus depuis ce dépôt

Plus depuis ce dépôt

ECMAScript Regex Translation

Entry Point API

Translation Examples

Why Translation Is Needed

Key Translation Rules

Character Class Shorthands

Word Boundaries

Supplementary Code Points

Backreferences

Unicode Property Escapes

Performance Characteristics

Regex Pattern Classification (at code-gen time)

Cross-References