
Reference Images:
{ "scene": { "location": "Modern city street at night", "background_elements": [ "Illuminated skyscrapers", "Palm trees wrapped in warm fairy lights", "Motion-blurred car headlights and tail lights", "Strong bokeh effect in the background" ], "lighting": "Cinematic soft directional lighting with urban reflections" }, "character": { "description": "Stylish young man with dark hair", "clothing": { "outerwear": "White denim jacket", "innerwear": "Fitted yellow crew-neck t-shirt", "bottoms": "Dark grey slim-fit cargo pants", "footwear": "Clean white sneakers" }, "accessories": [ "Black classic sunglasses", "Luxury silver wristwatch" ], "pose": "Casual leaning against the front of a car with hands in pockets", "expression": "Confident smile" }, "vehicle": { "make_model": "Red BMW M5", "details": "High-gloss metallic red paint, iconic kidney grille", "position": "Parked centrally on an asphalt city road" }, "technical_style": { "quality": "High-resolution photography", "aesthetic": "Professional cinematic fashion shoot", "focus": "Sharp foreground detail with shallow depth of field" } }

MONUMENT: The Forbidden City Photorealistic hero product photography shot on Sony A7III with 85mm f/1.4 lens at f/2.8, soft natural daylight from upper left creating gentle realistic shadows and visible surface textures: A giant famous [MONUMENT] masterfully recreated as a delicious multi-tiered edible cake. Visible fluffy sponge cake layers, thick silky buttercream frosting, smooth fondant details, subtle edible gold leaf accents, realistic cake crumbs and ganache drips. The cake tiers ingeniously transformed into a luxurious miniature vibrant living city with tiny glowing windows, staircases, gardens and balconies built into the frosting and sponge. Miniature visitors in tiny colorful clothes walk the buttercream pathways, relax on cake terraces, sip tiny drinks and take photos. Intricate realistic cake textures mixed with whimsical architecture, physically plausible yet surreal. Shot for a luxury architecture magazine cover, ultra-detailed 8K, perfectly centered square composition, no text, no artifacts.

Using the uploaded real subject as the visual baseline, create a full-body portrait with an independent, rebellious aesthetic and a vintage film texture. Embrace a 1990s point-and-shoot camera style, using direct flash, high-contrast lighting, retro color grading, and casual composition to convey a raw, authentic, and edgy “cool girl” atmosphere. Character Setup (Based on the real uploaded subject) Baseline: Use the real female subject as the foundation, preserving her facial features and body proportions. Hair: Thick, slightly messy long hair (color matching the uploaded subject). Outfit: Sporty black cropped top with white trim, paired with matching black shorts and white over-the-knee socks. Accessories: A silver bracelet worn on the wrist. Pose and Composition Pose: Sitting casually on an elevated dark wooden surface (such as a counter or piano top). The body leans forward slightly, relaxed but with a sharp edge. One hand is raised near the mouth, lightly biting a finger. The gaze is directed straight at the camera, with a sensual yet nonchalant expression. Composition: Full-body shot using a 35mm focal length, slightly low-angle perspective to emphasize leg lines. Medium-distance framing, appearing spontaneous yet visually balanced. Sharp focus on the subject, with the background falling off due to the rapid light decay of direct flash. Scene and Details Background: A white wall decorated with vintage posters taped onto it, along with brown wooden blinds. Foreground and Props: Include visible elements such as the neck of an electric guitar, a tin of cookies, and several bottles to enhance the lived-in, candid atmosphere. Lighting and Color Lighting: Simulate direct on-camera flash with hard lighting. The light should be harsh and frontal, casting sharp, well-defined shadows on the wall behind the subject. High contrast, with no soft lighting or diffusion. Color and Texture: Vintage film aesthetic, emulating Kodak Gold 200 tones. Warm hues from wooden blinds and furniture contrast with the cool white flash. Colors should be deep and slightly muted, with rich blacks and subtle film grain, creating a lo-fi indie visual style. Overall Mood and Style Style: Independent, rebellious “cool girl” aesthetic. Raw, authentic, with a Y2K-inspired retro fashion editorial vibe, resembling a snapshot. Mood: Casual, direct, with a hint of playful defiance. Technical Specifications and Constraints Image Quality: Photorealistic, but intentionally incorporating vintage film imperfections and direct-flash artifacts. Avoid: Soft lighting, studio lighting setups, shallow depth-of-field bokeh effects, polished studio portraits, skin smoothing, 3D-rendered looks, cartoon or illustration styles, distorted hands, missing guitar strings, floating objects, anatomical inaccuracies, stiff posing, excessive post-processing, and HDR effects.

Intelligent professional portrait headshot in the style of American ID photography. The subject size is balanced and natural within the frame. Use a subtle light gray to white gradient studio background. Lighting is soft and natural, highlighting realistic skin tones and gentle depth. The image should be clear and high-quality with sharp facial focus. Skin texture appears clean, luminous, and healthy. The head-to-shoulder proportion is comfortable and natural. The overall mood is modern and elegant. The subject has a relaxed, natural, and confident expression with bright, lively eyes. Medium portrait framing with the subject centered in the composition. Low contrast lighting to create a refined, professional studio portrait aesthetic. Suitable for business and professional profile photos. Aspect ratio: 3:4.

Using the uploaded real female subject as the visual baseline, generate an immersive, story-rich environmental portrait with a warm and comforting atmosphere. The scene is set by a sunlit window inside a secondhand bookstore, capturing a quiet afternoon moment shared between a young woman and a cat. The overall style should be cinematic, highly detailed, and photorealistic, using light, texture, and everyday elements to create a cozy, healing, and narrative-driven visual experience. Character Baseline and Styling Baseline: Strictly follow the facial features, structure, and youthful aura of the woman in the uploaded image. She should have long chestnut-brown hair. Expression: Calm and relaxed, immersed in the quiet rhythm of reading and rest. Hair: Long chestnut-brown hair, naturally falling. Clothing: A cream-colored oversized sweater, large enough to cover her hands, enhancing a sense of softness, comfort, and enclosure. Scene, Details, and Interaction Primary Setting: A sunlit secondhand bookstore. The subject is seated on a slightly dusty windowsill. Core Props and Interaction: Books: Vintage manga scattered beneath and around her. An open copy of “Nausicaä of the Valley of the Wind” lies beside her. Cat: A black-and-white tuxedo cat curled up asleep by her legs, with one paw resting gently on the open manga page. Subtle details such as the faint twitching of the cat’s whiskers during sleep should be captured. Action: The subject is sipping iced coffee from a ceramic cup, with visible vapor or condensation gently rising from the cup. Environmental Details: Outside the window is a soft baby-blue sky, with a distant clock tower suggesting midday through lighting cues. The aged yellowed pages of old books, the knit texture of the sweater, and the fine details of the cat’s fur must all be rendered with high clarity. Lighting, Composition, and Image Quality Lighting: Bright natural sunlight streams through the old bookstore window, casting patterned, lattice-like shadows across the subject’s face. Light is the dominant visual element, creating a warm, airy atmosphere that emphasizes a “comfortable afternoon” mood. Composition: Medium-close framing focused on the subject, the cat, and the windowsill area. The perspective should resemble an environmental portrait with strong narrative presence. Color and Tone: Warm overall color palette, soft lighting, delicate shadows, and naturally balanced saturation, highlighting a cozy and nostalgic feeling. Image Quality: 8K ultra-high resolution with extreme detail. Must clearly render volumetric light (Tyndall effect), sweater fibers, aged paper textures, cat fur, glass surface imperfections, and fine dust particles suspended in the air. Mood and Atmosphere Core Mood: Healing, tranquil, lazy, nostalgic, and rich with literary and everyday life ambiance. The image should convey a sense of time slowing down, with a perfect moment of solitude shared harmoniously between a person, her pet, and her books.

Core Concept Using the uploaded real female subject as the visual baseline, generate a high-fashion editorial image infused with Y2K retro aesthetics and strong visual contrast. The scene captures the subject in a summer outdoor setting, interacting with a pink vintage convertible covered in decorative foam, creating a lively, nostalgic, and fashion-forward visual narrative. Character Baseline and Styling Baseline: Strictly follow the facial features, structure, and personal aura of the subject in the uploaded image. Expression: Confident with a playful edge. Skin: Naturally radiant with a healthy glow. Physique: Well-proportioned and fit, with a natural forward-leaning posture. Hair: Long, dynamic wavy hair, gently lifted by a light breeze for a sense of motion. Outfit: Bright pink two-piece casual set paired with matching stylish sandals. Accessories: Fashion sunglasses, a delicate necklace, and minimal rings. Pose, Scene, and Lighting Pose: The subject leans naturally against the car, with both hands lightly touching the foam-covered surface. Scene: Bright outdoor setting featuring a pink vintage convertible coated in decorative foam as the central prop. Environment: Slightly wet ground reflecting ambient daylight, creating natural reflective highlights. Lighting: Bright natural sunlight producing balanced highlights and shadows, enhancing contrast and dimensionality. Color Palette: Dominant pink tones complemented by blue sky and white foam, forming a harmonious and vibrant color scheme. Atmosphere: Playful, nostalgic, and stylish. Composition, Quality, and Style Composition: Eye-level perspective with a slightly wide-angle feel. Technical Settings: Aperture at f/4.0 to maintain clarity across the frame. Image Quality: High-detail photorealism with strong emphasis on material textures and light behavior. Visual Style: Y2K-inspired retro fashion with cinematic storytelling.

Using the uploaded real female subject as the visual baseline, generate an ultra-high-detail fashion portrait that blends balletcore aesthetics with Rococo-inspired fantasy. The scene is set in a classical interior, capturing an elegant moment of the subject wearing a pink-blue voluminous skirt, creating an ethereal, dreamlike, and haute couture visual narrative. Character Baseline and Styling * Baseline: Strictly follow the facial features, structure, and overall aura of the woman in the uploaded image. Preserve her distinctive identity and soft characteristics. * Expression: Soft and ethereal, with large, expressive eyes. Skin texture should be smooth with a natural luminous glow and subtle warm undertones. * Physique: Slender and elegant body, with realistic skin tones and soft highlights on the legs and upper chest. * Hair: Voluminous long blonde curls, appearing wind-swept. Hair strands should form a dynamic, airy halo effect around the head and shoulders. * Outfit: Top: A pale pink structured corset with front lace-up detailing and delicate ribbon-tied shoulder straps. Bottom: An ultra-short, highly voluminous multi-layered ballet tutu composed of gray-green and soft pink tulle layers. * Style: Romantic balletcore with a touch of Rococo whimsy. * Footwear: Minimalist white lace-up high heels with a refined ankle strap. Pose, Environment, and Lighting * Pose: Standing with the body slightly leaning against a white wall, ankles crossed to create an elegant silhouette. Arms placed behind the back, gently pushing the upper body forward. * Environment: A refined classical interior featuring white paneled double doors with decorative molding, and a polished light herringbone wood floor. * Lighting: Bright, diffused natural light entering from a side window, casting soft shadows and highlighting the delicate textures of tulle and lace. * Color Palette: Soft pastel tones including blush pink, muted gray-green, and clean gallery white. * Atmosphere: Dreamy, ethereal, with a high-fashion editorial mood. Composition, Quality, and Style * Composition: Eye-level full-body cinematic shot using a 50mm f/1.2 lens. * Technical: Aperture set to f/2.0 to achieve soft focus falloff. * Image Quality: Ultra-photorealistic, RAW-style rendering at 8K resolution with extreme detail. Precisely capture the transparency and layering of tulle, the satin sheen and texture of the corset, the curl and gloss of the hair, the fine details of skin, and the reflective quality of the wooden floor. Rendering quality should meet standards of volumetric lighting, ray-traced reflections, ultra-realistic textures, and medium-format (Hasselblad-like) photographic fidelity. * Style: Hyper-realistic, romantic narrative, haute couture fashion photography.

{ "metadata": { "image_type": "photograph", "primary_purpose": "editorial" }, "composition": { "rule_applied": "rule of thirds", "aspect_ratio": "4:5", "layout": "single subject", "focal_points": [ "Subject's face and dark sunglasses", "Yellow fluorescent light fixture leading to the subject" ], "visual_hierarchy": "Eye starts at the subject's face due to contrast with sunglasses, follows the V-neck down the black coat, moves outward along the metallic handrails, and catches the bright sunlight shaft on the left wall and the yellow light on the right.", "balance": "asymmetric" }, "color_profile": { "dominant_colors": [ { "color": "Black", "hex": "#111111", "percentage": "45%", "role": "primary subject" }, { "color": "Warm Gray", "hex": "#7A7269", "percentage": "35%", "role": "background" }, { "color": "Golden Yellow", "hex": "#E5B05C", "percentage": "10%", "role": "accent" } ], "color_palette": "complementary", "temperature": "warm", "saturation": "moderate", "contrast": "high contrast" }, "lighting": { "type": "mixed", "source_count": "multiple sources", "direction": "front", "directionality": "highly directional", "quality": "dramatic", "intensity": "moody", "contrast_ratio": "high contrast (dramatic shadows)", "mood": "mysterious", "shadows": { "type": "harsh defined edges", "density": "deep black", "placement": "under handrails, within stair treads, and engulfing the subject's lower body", "length": "medium" }, "highlights": { "treatment": "blown out", "placement": "on upper left wall from sun shaft, fluorescent light fixture, and reflective handrails" }, "ambient_fill": "present", "light_temperature": "warm (golden)" }, "technical_specs": { "medium": "digital photography", "style": "realistic", "texture": "sharp", "sharpness": "tack sharp", "grain": "digital noise", "depth_of_field": "medium", "perspective": "high angle" }, "artistic_elements": { "genre": "street", "influences": [ "Urban street style fashion photography", "Cinematic cyberpunk/matrix aesthetic" ], "mood": "sophisticated", "atmosphere": "Gritty yet polished urban transit environment with a moody, imposing, and highly stylized fashion presence", "visual_style": "structured" }, "typography": { "present": false, "fonts": [], "placement": "", "integration": "" }, "subject_analysis": { "primary_subject": "Young male model wearing an oversized, structured long black overcoat, black V-neck top, black trousers, and dark rectangular sunglasses, walking down stairs. Hair: Medium length, textured cut (e.g., modern shag/tapered cut). Length: Roughly ear-level/temple-level on the sides, slightly longer and voluminous on top (3-4 inches), with soft layers and natural movement. Texture: Natural wavy texture, showing varied strands, slight frizz, and natural volume (imperfections visible, not 'AI smooth'), styled loosely back from the forehead without a distinct part. Evidence of light matte product application.", "positioning": "center", "scale": "medium", "interaction": "interacting with environment", "facial_expression": { "mouth": "neutral", "smile_intensity": "no smile", "eyes": "direct gaze", "eyebrows": "neutral", "overall_emotion": "serious", "authenticity": "posed" }, "hands_and_gestures": { "left_hand": "resting at waist level, lightly holding the edge of the black overcoat, silver ring visible on finger", "right_hand": "not visible, hidden behind the coat's bulk or swinging backward", "finger_positions": "left hand thumb and index finger slightly curled and gripping fabric, remaining fingers obscured", "finger_interlacing": "none", "hand_tension": "relaxed", "interaction": "holding coat", "naturalness": "deliberately posed" }, "body_positioning": { "posture": "standing", "angle": "facing camera", "weight_distribution": "shifted", "shoulders": "level" } }, "background": { "setting_type": "indoor", "spatial_depth": "medium", "elements_detailed": [ { "item": "Stainless steel handrail", "position": "left", "distance": "foreground", "size": "medium", "condition": "worn", "specific_features": "Tubular metal with distinct mounting brackets and subtle reflections" }, { "item": "Stainless steel handrail", "position": "right", "distance": "midground", "size": "medium", "condition": "worn", "specific_features": "Tubular metal matching the left side, catching edge highlights" }, { "item": "Fluorescent light fixture", "position": "top right", "distance": "background", "size": "medium", "condition": "vintage", "specific_features": "Rectangular housing emitting a strong, warm amber/yellow light" } ], "wall_surface": { "material": "tile", "surface_treatment": "finished", "texture": "perfectly smooth", "finish": "satin", "color": "warm grayish-brown", "color_variation": "patchy", "features": "clean, distinct grid-like grout lines separating large rectangular stone/granite panels, sharp diagonal shaft of warm sunlight hitting the upper left section", "wear_indicators": "industrial" }, "floor_surface": { "material": "concrete", "color": "dark gray with yellow accents", "pattern": "striped" }, "objects_catalog": "Metal diamond-plate stair treads with yellow painted safety edges running horizontally across the bottom and midground, dual tubular stainless steel handrails flanking the stairs, rectangular yellow light fixture mounted to the ceiling angled directly above the stairwell, large rectangular stone tiles covering the walls.", "background_treatment": "sharp" }, "generation_parameters": { "prompts": [ "Editorial street style photography, a handsome young man with medium length, textured modern shag haircut showing natural wavy movement and styled loosely back. He wears dark rectangular sunglasses, a long black oversized wool coat, and an all black outfit, walking down urban subway stairs. The walls are made of large polished brown-gray stone tiles with visible grid grout lines. A dramatic shaft of warm golden sunlight hits the upper left wall. A vintage yellow fluorescent light fixture is on the upper right ceiling. The stairs are concrete with metal edges and yellow safety paint. Stainless steel handrails. Cinematic lighting, deep dramatic black shadows, sharp focus, 85mm lens, high contrast, moody, fashionable underground matrix aesthetic.", "Cinematic fashion portrait, male model in structural black overcoat descending transit stairs, medium length textured hair with soft waves, tinted glasses. High angle shot looking down the steps. Brutalist architecture background, granite wall panels. Mixed lighting featuring harsh geometric sunbeams on the wall contrasting with artificial yellow ambient light. Gritty texture, high dynamic range, hyper-realistic, sophisticated urban mood." ], "keywords": [ "street style", "oversized black coat", "subway stairs", "cinematic lighting", "medium length hair", "textured hair", "modern shag" ], "technical_settings": "85mm lens, f/4.0 aperture for medium depth of field keeping background architectural elements recognizable, 1/250s shutter speed, ISO 400. Metered to preserve the sun shaft highlight while letting shadows fall to deep blacks.", "post_processing": "Color grading to enhance warm golden highlights against neutral/cool shadows, boosted contrast curve, subtle film grain introduced in the deep shadow areas for texture." } }