
{ "task_description": "Generate a flagship-level full-category brand identity mockup", "prompt_structure": { "subject": "A massive, ultra-detailed brand ecosystem flat lay", "brand_info": { "name": "Marvin Studio", "industry": "Artisan Bakery & Coffee", "visual_theme": "Nordic Forest, Minimalist, Warm" }, "item_categories": { "packaging": [ "shopping bags (S/M/L)", "cake boxes", "coffee bean pouch", "sandwich wedge box", "pastry box" ], "dining_essentials": [ "coffee cups with sleeves", "paper napkins", "sugar packets", "tray liner paper", "menu board" ], "staff_uniform": [ "folded green apron", "embroidered baseball cap", "staff name tag" ], "merchandise": [ "canvas tote bag", "ceramic mug", "branded notebook", "ballpoint pen" ] }, "visual_style": { "art_style": "Clean vector mockup with realistic texture mapping", "color_palette": [ "Forest Green (Dominant)", "Matte White (Sub-dominant)", "Kraft Paper Brown (Texture)", "Beige (Accent)" ], "logo_style": "Minimalist line-art deer" }, "composition": { "type": "Dense Knolling", "background": "Solid dark charcoal (#1A1A1A)", "perspective": "90-degree Top-down (Orthographic)" } }, "negative_prompt": "messy, cluttered, overlapping, 3D perspective distortion, photorealistic reflection, glossy plastic, neon colors, human hands, watermark, low resolution", "generation_params": { "aspect_ratio": "3:4", "detail_level": "High", "texture_quality": "Premium" } }

A hyper-realistic 8K close-up portrait of a person's head and upper neck, shot from a slightly low-angle perspective looking upward. Use the uploaded image as the face reference - the face must match 100% exactly (same identity, facial structure, proportions, skin details, and expression). Do not alter the face in any way. The subject is wearing bright yellow sunglasses with reflective lenses showing colorful abstract digital scenes in shades of pink, blue, and yellow. The face is rendered in detailed grayscale, showing realistic skin texture, pores, and light stubble on the jawline, creating strong contrast with the rest of the head. The hair and most of the head and neck are made of glowing digital circuit patterns, abstract shapes, lines, and data streams in vibrant colors like magenta, cyan, blue, green, yellow, and orange. These elements appear layered and complex, with a soft internal glow. Parts of the digital head are breaking apart and dissolving outward into pixels, lines, and glitch fragments that fade into a clean white background, giving a glitch-art, futuristic look. Cinematic lighting highlights one side of the face, with shadows under the chin and subtle rim lighting around the digital elements. Overall style is futuristic, cyber-inspired, high-detail, and photorealistic.

Create an ultra-wide infographic poster titled “Kyoto Iconic Travel Classics Atlas”, with a 21:9 horizontal layout in true 8K resolution. The visual language blends ceramic glaze and jade-like softness, expressing calm luxury, precision, and cultural depth. The overall mood is restrained, warm, and translucent—evoking the feeling of a museum-grade exhibition combined with refined cultural branding. All on-image text is in English. Background and overall material tone * Background features a light rice-white to misty off-white gradient, similar to a high-end ceramic exhibition space * Global lighting is gentle and diffused: soft HDRI illumination with subtle rim light, avoiding harsh contrast or strong shadows * Color palette centers on jade white, pale celadon, soft moss green, warm beige, with minimal accents of vermilion or muted gold (no more than 3–4 dominant hues across the entire image) Layout: eight equal-width “ceramic/jade plaques” (strictly eight columns) * The central composition uses a strict eight-column grid, each column designed as a ceramic-style information plaque * Plaque material resembles fine white porcelain glaze: rounded corners, smooth highlights, subtle reflective sheen, clean edges * Borders may include an extremely delicate gold trace or jade-edge effect, applied with restraint * Shadows are soft and shallow, as if the plaques are placed on a curated museum display surface Core visuals: realistic 3D landmarks (PBR) with a serene glazed aesthetic * Each column top features one real, iconic Kyoto landmark (automatically select eight authentic sites, no fictional entries) * Use realistic 3D environment rendering with physically based materials, while maintaining a calm, jade-like visual softness * Materials must follow real-world physical rules: wood, stone, tile, water, foliage, and paper surfaces rendered with accurate PBR properties * Unified color grading: soft highlights, moderate contrast, low saturation, and gentle warmth inspired by ceramic glaze and polished jade * Composition emphasizes depth and quiet grandeur through layered space and subtle scale references (small figures, lanterns, trees) * Water elements (rivers, ponds, moats) should display delicate reflections and smooth tonal transitions * Avoid cartoon styles, low-poly geometry, plastic textures, excessive sharpness, or dramatic cinematic lighting Detail view: glazed “micro dish” or “jade charm” window * Each column includes one small detail window in the lower-right corner, using a single consistent shape across all columns * Option A: a small circular dish resembling a porcelain saucer, showcasing fine details (wood grain, roof tiles, stone steps, garden gravel, water ripples) * Option B: a jade-charm-shaped window with rounded contours, highlighting close-up textures and materials * Surfaces should exhibit soft glaze or jade-like highlights with smooth, rounded edges * A thin, elegant leader line connects the detail window to the main 3D landmark, accompanied by a concise 2–6 word label Text per column (concise, refined, understated) * Landmark name (4–8 English words, refined serif typeface recommended) * Two short informational lines: * Highlight: maximum 8 words * Best for: maximum 6 words * Text color uses ink gray or bluish charcoal; accent colors appear only in small dots or separators Top title area (museum exhibit label style) * Main title: Kyoto Iconic Travel Classics Atlas * Subtitle: one refined positioning sentence, maximum 18 words, calm and poetic in tone * Top-right corner includes five small glazed color dots: pale celadon, jade white, moon white, warm beige, with a subtle vermilion accent Bottom information band (exhibition caption style) * A full-width bottom band divided into four sections by fine lines, resembling museum display captions * Kyoto at a glance: one sentence, maximum 22 words * Visiting tips: three bullet points, each no more than 12 words * Cultural trivia: three items, each no more than 18 words * Timeline: 3–6 key nodes (era or year plus keyword), connected with minimal linear icons using thin, consistent strokes Automatic landmark selection rules * Cover at least five categories: natural scenery, historical temples or shrines, traditional architecture, museums or cultural institutions, classic streets or seasonal night scenes, and intangible cultural experiences (tea ceremony, festivals, crafts) * If necessary, fill gaps with safe, general entries such as “Kyoto National Museum” or “Historic Gion District” * All landmark names must be real and verifiable; if uncertain, use conservative generic naming and never fabricate locations Output constraints * English text only * Strong alignment, minimalist spacing, generous white space * No logos, no branding marks, no watermarks * True 8K ultra-high resolution, suitable for cultural tourism displays and horizontal social media covers

Ultra-realistic luxury fashion editorial featuring the female from reference photo walking toward camera mid-step, natural arm swing, focused eyes, candid paparazzi energy. Scene: underground parking garage, strip lighting, long depth. Outfit: cream cropped blouse, dark denim jeans, black stilettos. Accessories: small quilted handbag, minimal jewelry. Mood: real, powerful, celebrity presence. Style: street fashion + Vogue hybrid. Image Quality: 8K, balanced sharpness, subtle film grain. Lighting: ambient + soft key light. Color Grading: neutral cinematic palette. Camera: DSLR, 85mm, f1.8. Parameters: 9:16, steps 50, CFG 8.0. Negative: staged pose, artificial glow.

Style & Subject: A high-end minimalist vector illustration of [Barcelona] skyline, featuring the distinct silhouettes of [Sagrada Família, Torre Glòries, Montjuïc Castle, Barcelona Cathedral]. The architectural forms are deconstructed into sharp geometric facets and clean vertical layers, emphasizing a sense of rational order and modern rhythmic beauty. Artistic Elements: Integrate flowing, translucent abstract ribbon-like curves in the background that weave gracefully through the skyscrapers. These curves should be thin and elegant, providing a dynamic contrast to the solid, static buildings. No messy details; focus on pure form and "breathing" negative space. Color Palette: A sophisticated, curated palette of muted teals, soft corals, warm ochre, and deep midnight blue. Use subtle tonal gradients and overlapping translucent layers to create depth and volume without using realistic shadows or 3D textures. Typography & Layout: Centered at the very bottom of the canvas, place the text "[Barcelona]", elegant uppercase sans-serif font. The text must be situated in a clean, generous white margin at the bottom 15% of the image, completely isolated from the architectural graphics. --ar 4:5

[INPUT IMAGE: USER_PHOTO] Use the person in the input image as the ONLY subject. Preserve their identity and facial features clearly. Create a hyper-realistic high-fashion editorial photo inside a surreal 3D geometric “color box” room (a hollow cube / tilted cube set). Each render MUST randomly choose: 1) a bold single-color box (monochrome environment, vivid and saturated), 2) a dynamic “cool” fashion pose (gravity-defying or extreme stretch / leap / sideways bracing against the walls), 3) a dramatic camera angle (wide-angle 24–35mm equivalent, tilted horizon, strong perspective). The subject appears full-body and sharp, wearing an avant-garde fashion styling that feels modern and editorial (clean silhouette, stylish layering, premium fabric texture). Keep clothing tasteful and fashion-forward. The subject’s pose should feel athletic, stylish, and unusual—like a magazine campaign shot. Lighting: studio quality, crisp and cinematic; strong key light with controlled soft shadows, subtle rim light; realistic reflections and bounce light from the colored walls. Ultra-detailed skin texture, natural pores, realistic fabric weave, clean edges, high dynamic range. Composition: subject centered with plenty of negative space and strong geometric lines; the box perspective frames the subject. Color: the box color is a SINGLE bold color and MUST be different each run (random vivid hue). The subject’s outfit contrasts well with the box color. Output: hyper-real, photorealistic, 8k detail, editorial campaign quality, sharp focus on subject, no motion blur, no distortion of face, natural proportions.

Create a miniature 3D isometric diorama showing the invention of [Light Bulb] at the moment of [Edison successfully testing the first long-lasting carbon filament bulb]. Camera angle around 40° from above. Textures feel soft and polished. Materials follow realistic PBR rules. Lighting feels natural and balanced. The raised base includes tools, workshop elements, notes, and early prototypes. Tiny stylized inventors interact with objects. Faces are visible and recognizable with clean shapes and expressions. Background stays solid [warm dark navy blue]. Top center text shows [Light Bulb] in bold. Second line shows [Thomas Edison — 1879]. A simple line icon of the invention sits below. Text color adapts to background contrast.

A high-fidelity, wide-angle interior shot captures a surreal mixed-media composition inside a modern living room. The main human subject uses the uploaded reference face as a strict and unchangeable identity anchor. The uploaded reference face must be preserved with extreme fidelity and accuracy. Facial structure, proportions, bone anatomy, eye shape, nose, lips, skin texture, and overall likeness must closely and clearly match the reference image. Do not alter, stylize, exaggerate, beautify, replace, or reinterpret the human face in any way. The human subject must remain fully photorealistic at all times, even when surrounded by cartoon characters. The subject wears realistic clothing whose colors are carefully chosen to harmonize with the Winnie the Pooh color palette, including warm honey yellows, soft reds, and gentle pastel tones. Clothing materials remain realistic, with natural fabric texture, while the color scheme visually blends into the cartoon world. The subject sits centrally on a plush light gray sofa. The environment blends a photorealistic 3D living room with soft, hand-drawn 2D storybook-style Winnie the Pooh characters interacting naturally with the physical space. On the right side of the sofa, Winnie the Pooh sits happily holding a honey pot. On the left side, Piglet sits shyly on a cushion. Behind the sofa, two framed posters hang on the white wall: one showing Pooh and friends in the Hundred Acre Wood, the other featuring Tigger mid-bounce. In the foreground, Tigger relaxes playfully on a textured gray carpet while Eeyore rests calmly nearby. A small plate with honey-themed treats sits on the floor. A tiny chibi-style Pooh stands proudly on a cream-colored knit pouf. Near the wooden coffee table, Rabbit observes thoughtfully, while Owl perches near a snake plant by the window. Soft natural light enters from the left through sheer curtains, with subtle volumetric lighting enhancing fabric textures, wood grain, and foliage. Rendered in 8K resolution, sharp focus, warm pastel colors, dreamy and wholesome cinematic atmosphere. Seamless blend of real world and cartoon world, no AI visible. Negative prompt: no face distortion, no face replacement, no stylized human face, no cartoon human, no anime human, no altered identity

Ultra-realistic cinematic food photography of freshly steamed dumplings (momos) on a rustic wooden board. One scene shows finished dumplings with glossy dough, delicate pleats, light oil sheen, and steam rising. Another scene shows an exploded view: dumpling wrappers and spiced minced filling floating mid-air, flour particles frozen in motion, herbs scattered, dramatic slow-motion effect. Warm moody lighting, shallow depth of field, dark rustic kitchen background, professional studio food styling, hyper-detailed textures, cinematic realism, 8K, macro photography, high contrast.