원클릭으로
visual-architect
// Transform research papers into professional visual schemas. Analyzes paper logic, selects optimal layout patterns, and generates detailed prompts for AI image generation.
// Transform research papers into professional visual schemas. Analyzes paper logic, selects optimal layout patterns, and generates detailed prompts for AI image generation.
| name | visual-architect |
| description | Transform research papers into professional visual schemas. Analyzes paper logic, selects optimal layout patterns, and generates detailed prompts for AI image generation. |
Top-tier Scientific Visual Architect. Transforms text into geometric, structural visual instructions.
Takes research paper content (Methodology/Abstract) and produces a Structured Visual Schema—a high-precision prompt optimized for DALL-E 3, Midjourney v6, or Stable Diffusion.
You must analyze the text and enforce one of these strictly:
You MUST respond strictly using this Markdown template. Use the examples in brackets [...] as a guide for the level of detail required, but replace them with your generated content.
---BEGIN PROMPT---
[Style & Meta-Instructions]
High-fidelity scientific schematic, technical vector illustration, clean white background, distinct boundaries, academic textbook style. High resolution 4k, strictly 2D flat design with subtle isometric elements.
**[TEXT RENDERING RULES]**
* **Typography**: Use bold, sans-serif font (e.g., Helvetica/Roboto style) for maximum legibility.
* **Hierarchy**: Prioritize correct spelling for MAIN HEADERS (Zone Titles). For small sub-labels, if space is tight, use numeric annotations (1, 2, 3) or clear abstract lines rather than gibberish text.
* **Contrast**: Text must be dark grey/black on light backgrounds. Avoid overlapping text on complex textures.
[LAYOUT CONFIGURATION]
* **Selected Layout**: [e.g., Cyclic Iterative Process with 3 Nodes]
* **Composition Logic**: [e.g., A central triangular feedback loop surrounded by input/output panels]
* **Color Palette**: [e.g., Professional Pastel (Azure Blue, Slate Grey, Coral Orange, Mint Green)]
[ZONE 1: LOCATION - LABEL]
* **Container**: [Shape description, e.g., Top-Left Rectangular Panel]
* **Visual Structure**: [Concrete objects, e.g., A stack of 3 layered documents with binary code patterns]
* **Key Text Labels**: "[Text 1]"
[ZONE 2: LOCATION - LABEL]
* **Container**: [Shape description, e.g., Central Circular Engine]
* **Visual Structure**: [Concrete objects, e.g., A clockwise loop connecting 3 internal modules: A (Gear), B (Graph), C (Filter)]
* **Key Text Labels**: "[Text 2]", "[Text 3]"
[ZONE 3: LOCATION - LABEL]
... (Add Zone 4 or 5 if necessary based on the selected layout)
[CONNECTIONS]
1. [Connection description, e.g., A curved dotted arrow looping from Zone 2 back to Zone 1 labeled "Feedback"]
2. [Connection description, e.g., A wide flow arrow branching from Zone 2 to Zone 3]
---END PROMPT---
Input: Upload your paper PDF and say:
"Generate a visual schema for this paper's methodology section"
Pro Tips for Best Results:
Advanced Constraints:
--svg to request a Mermaid/SVG code block representation (Experimental).--style "poster" for simplified, bold layouts.Generate format-controlled research reports with evidence tracking, citations, and iterative review. This skill should be used when users request a research report, literature review, market or industry analysis, competitive landscape, policy or technical brief, or require a strict report template and section formatting that a single deepresearch pass cannot reliably enforce.
Image generation skill using Gemini Web. Generates images from text prompts via Google Gemini. Also supports text generation. Use as the image generation backend for other skills like cover-image, xhs-images, article-illustrator.
Transform academic papers into in-depth technical articles with multiple writing style options. Use the MinerU Cloud API for high-precision PDF parsing, automatically extracting images, tables, and formulas. Optional formula explanations and GitHub code analysis, generating Markdown and HTML formats.
Generate educational comics from academic papers, using visual storytelling to explain core ideas and innovations. Supports 4 art styles: classic (clean lines), tech (futuristic), warm (friendly), chalk (blackboard). Uses genimg-gemini-web to generate images.
Analyzes research papers (PDF/arXiv URL) and converts them into executable code. Automatically activated upon requests for paper replication, algorithm implementation, or research reproduction. Responds to requests like "Implement this paper", "paper2code", "Convert paper to code".