
Reference Images:
{ "image_generation_prompt": { "subject_description": "A person, whose facial features match the uploaded reference image, wearing high-quality winter alpine clothing (insulated jacket, gloves, backpack).", "action": "In the act of stepping out of an open modern cable car gondola onto a snow-covered mountain station deck.", "environment": "A rugged high-altitude alpine station surrounded by panoramic snow-capped mountain peaks under a clear sky. The station architecture is a mix of wood and metal covered in fresh snow.", "specific_details": "The words 'WELCOME 2026' are distinctly carved into a weathered wooden wall section of the station building next to the exit. Incredible detail in snow crystals and fabric textures.", "lighting_and_style": "Hyper-realistic 8K cinematic photography taken with a 35mm lens. Crisp, bright, natural mountain daylight creating sharp clarity and high contrast. Slight film grain." }, "negative_prompt": "cartoon, 3d render, illustration, painting, low quality, blurry, out of focus, distorted face, bad anatomy, extra limbs, summer, green grass, warm weather, studio lighting, flat lighting", "aspect_ratio": "3:4", "use_face_reference": true }

Using the uploaded real female subject as the visual baseline, generate an immersive, story-rich environmental portrait with a warm and comforting atmosphere. The scene is set by a sunlit window inside a secondhand bookstore, capturing a quiet afternoon moment shared between a young woman and a cat. The overall style should be cinematic, highly detailed, and photorealistic, using light, texture, and everyday elements to create a cozy, healing, and narrative-driven visual experience. Character Baseline and Styling Baseline: Strictly follow the facial features, structure, and youthful aura of the woman in the uploaded image. She should have long chestnut-brown hair. Expression: Calm and relaxed, immersed in the quiet rhythm of reading and rest. Hair: Long chestnut-brown hair, naturally falling. Clothing: A cream-colored oversized sweater, large enough to cover her hands, enhancing a sense of softness, comfort, and enclosure. Scene, Details, and Interaction Primary Setting: A sunlit secondhand bookstore. The subject is seated on a slightly dusty windowsill. Core Props and Interaction: Books: Vintage manga scattered beneath and around her. An open copy of “Nausicaä of the Valley of the Wind” lies beside her. Cat: A black-and-white tuxedo cat curled up asleep by her legs, with one paw resting gently on the open manga page. Subtle details such as the faint twitching of the cat’s whiskers during sleep should be captured. Action: The subject is sipping iced coffee from a ceramic cup, with visible vapor or condensation gently rising from the cup. Environmental Details: Outside the window is a soft baby-blue sky, with a distant clock tower suggesting midday through lighting cues. The aged yellowed pages of old books, the knit texture of the sweater, and the fine details of the cat’s fur must all be rendered with high clarity. Lighting, Composition, and Image Quality Lighting: Bright natural sunlight streams through the old bookstore window, casting patterned, lattice-like shadows across the subject’s face. Light is the dominant visual element, creating a warm, airy atmosphere that emphasizes a “comfortable afternoon” mood. Composition: Medium-close framing focused on the subject, the cat, and the windowsill area. The perspective should resemble an environmental portrait with strong narrative presence. Color and Tone: Warm overall color palette, soft lighting, delicate shadows, and naturally balanced saturation, highlighting a cozy and nostalgic feeling. Image Quality: 8K ultra-high resolution with extreme detail. Must clearly render volumetric light (Tyndall effect), sweater fibers, aged paper textures, cat fur, glass surface imperfections, and fine dust particles suspended in the air. Mood and Atmosphere Core Mood: Healing, tranquil, lazy, nostalgic, and rich with literary and everyday life ambiance. The image should convey a sense of time slowing down, with a perfect moment of solitude shared harmoniously between a person, her pet, and her books.

Style Direction: Ultra-photorealistic cinematic lifestyle photography, blending Japanese fresh minimalism with Korean tonal aesthetics. Emphasize a sense of relaxation and emotional softness, with a clean, gentle visual style that combines fashion editorial storytelling with everyday warmth. Theme and Atmosphere: A quiet, lazy afternoon by the window, conveying calmness, softness, and intimacy. Use light and body language to express a “healing daily life” mood. Reference Baseline: Strictly use the uploaded subject as the sole identity reference. Accurately reproduce facial structure, proportions, skin tone, hairstyle, and all recognizable features. No stylization or alteration of the real face. Aspect Ratio and Framing: 4:5 vertical composition. Full-body or half-body framing depending on the reference, prioritizing upper body and leg interaction. Centered alignment with balanced visual weight. Subject Placement: The subject leans by the window in a relaxed posture, occupying the right two-thirds of the frame. The left side is left open to show a softly blurred outdoor street scene, creating an emotional contrast between interior and exterior. Interaction Design: Relaxed posture, hands naturally overlapped and resting between the legs. Back gently leaning against the wall, legs slightly bent, conveying a casual, lived-in interaction. Real Subject (Photorealistic Rendering) Facial Fidelity: 100% preservation of original facial structure, proportions, skin tone, and hair color. Skin texture must be realistic, with visible pores and natural light transitions, avoiding any artificial smoothing. Eyes appear soft and slightly unfocused, with a faint smile conveying relaxed ease. Expression: Calm, slightly distant, with a soft, dreamy gaze and naturally relaxed lips, expressing a gentle and healing presence. Pose: Based on the reference sitting pose — back lightly against the wall, legs slightly bent and overlapped, hands resting naturally on the legs, shoulders fully relaxed with no stiffness. Wardrobe Reconstruction: Inner Layer: White spaghetti-strap tank top with ribbed texture, closely fitting the skin with visible material detail. Outer Layer: Blue-and-white striped shirt, loose fit, sleeves slightly rolled, natural fabric folds, evenly spaced stripes. Bottom: Likely shorts or a short skirt (frame limited to legs), maintaining a casual homewear aesthetic. Material Rendering: Skin-contact softness of the tank top, cotton texture of the shirt, and gravity-driven folds must all be clearly visible and realistic. Lighting and Environment Lighting Setup: Side-back natural light simulating sunlight from the window on the left. Light grazes the face, shoulders, and clothing, creating soft highlights, while shadows transition naturally (e.g., neck, inner arms), enhancing depth and atmosphere. Background: Interior: Light gray matte wall paired with an off-white cushion (soft, voluminous, with natural folds under light). Exterior: Blurred street scene outside the window, with faint silhouettes of buildings and pedestrians in cooler tones, contrasting with the warm interior light. Environmental Details: Dark window frame (metal or wood texture), clean windowsill, realistic cushion volume and folds, minimal clutter to maintain a clean, minimalist home aesthetic. Output Style Image Quality: 8K resolution with RAW-level detail. Hair strands, skin pores, fabric textures, and light gradients must be rendered with extreme realism. Color Grading: Low-saturation soft tones dominated by off-white, light gray, and pale blue, with warm yellow highlights simulating afternoon sunlight. Overall palette should feel fresh, calm, and healing. Post-processing Look: Cinematic grading with preserved shadow detail, soft highlights, and moderate contrast to reinforce the relaxed emotional tone. Lens Language: 35mm prime lens, f/2.8 aperture, with slight background blur and natural depth transition, keeping the subject as the visual focus.

Using the uploaded real subject as the visual baseline, create an ultra-high-detail, photorealistic portrait with a soft, atmospheric mood. Aim for a warm vintage film aesthetic, shallow depth of field, and cinematic color grading in 4K resolution, evoking a relaxed, introspective, and slightly melancholic intimate atmosphere. Character Setup Baseline: Strictly base the subject on the uploaded real female, preserving her facial features, proportions, and expression. The body should appear slender with natural curves. Hair and Makeup: Long hair with a slightly tousled look. Makeup is natural and minimal, with soft, well-defined brows. Fingernails are painted in a dark color, and a small silver ring is worn on the left hand. Expression: Looking directly at the camera with a gentle, subtle smile. Styling and Pose Outfit: An oversized, thick chunky knit sweater with rich texture, blending pink, dark gray, and white yarns, worn as a short dress. Paired with thick light gray ribbed over-the-knee knitted socks. Pose: Sitting on a lightly patterned gray granite kitchen countertop. Knees are drawn tightly to the chest, with the right arm wrapped around the legs. The left hand holds a simple pink coffee cup near the shoulder. Scene and Lighting Environment: A cozy kitchen in soft daytime lighting. The background is blurred but recognizable, including white cabinets with round silver handles, a stainless steel stove with a range hood and oven, a large multi-pane window, and hanging cookware in the upper left. On the countertop behind her are a simple black cup, a red electric kettle, and a transparent glass bottle. Lighting: Soft, diffused natural daylight enters from behind and from the left-side window, casting gentle highlights on her face and the knit texture of the clothing. Color and Atmosphere Color Palette: Emphasize charcoal gray, pink, warm cream tones, black, and soft red, creating a warm, slightly faded vintage film aesthetic. Mood: Relaxed, introspective, with a subtle melancholic tone. Photography and Image Quality Composition: 3:4 vertical format, medium-to-wide full-body shot, eye-level perspective. Simulate a modern smartphone camera with shallow depth of field. Focus and Quality: Sharp focus on the subject’s face and eyes, with cinematic color grading and realistic detail rendering at 4K resolution.

Using the real female subject you uploaded as the visual baseline, generate an ultra-high-precision photorealistic portrait that blends fantasy storytelling with high-fashion aesthetics. The character is reimagined as a modern Cupid, poised elegantly among the clouds, creating a visual spectacle that is ethereal, dramatic, and fashion-forward. Character Baseline and Styling Baseline: Strictly follow the facial contours, proportions, skin texture, and overall aura from the reference image. Age and ethnic characteristics must remain consistent with the original. Expression: Enhanced soft-glow makeup that remains faithful to her natural features. Expression is focused, with sharp, intent eyes suggesting precise aim. Hairstyle: Fully replicate the hair color, length, texture, and natural form from the reference image. Wardrobe, Accessories, and Pose Main Dress: A fantasy-inspired haute couture gown. Design: Structured, intricately detailed strapless bodice with a long, flowing train. Material: Constructed from layered, realistic soft pink cherry blossom petals, creating a light, semi-transparent, airy effect with clearly defined petal textures. Color: Dominated by soft sakura pink, blended with crimson and nude pink in a natural gradient. Wings: Large petal wings composed of countless cherry blossom petals arranged in a dimensional collage, transitioning naturally from pale pink to near-white. Each petal must exhibit realistic texture and light transmission. Footwear: Ultra-thin high-heeled decorative sandals, wrapped with vines and petals, accented with white feathers. Weapon Accessory: Holding an ornate gilded longbow with intricate engravings. The arrow tip is set with a heart-shaped gemstone, emitting a soft pink-gold glow. Pose and Composition Pose: The subject sits gracefully atop a fluffy cloud. The right leg is bent for support, while the left leg extends forward. The upper body leans slightly forward as she draws the bow, combining elegance with controlled strength. Composition: Medium shot with the subject centered in the frame, wings fully extended. Eye-level perspective. Sharp focus on the face and bow, with the surrounding clouds slightly softened. Environment and Lighting Environment: Suspended in a bright sky, seated on voluminous clouds. The background features a vivid blue sky with scattered floating cherry blossom petals, creating a dreamy, weightless atmosphere. Lighting: Soft directional diffused lighting. The key light from above at an angle defines the silhouette, complemented by fill light to maintain clean shadows. Light creates crystalline highlights on gemstones, metal surfaces, and petals, resulting in an airy yet dimensional tonal quality. Style and Texture Visual: Photography-grade hyperrealism with cinematic impact and extreme detail fidelity. Detail: Supports 8K-level rendering. All materials—skin, petals, metal—must achieve high physical realism. Fusion: Seamlessly combines high-fashion aesthetics, mythological drama, and romantic fantasy imagination. Quality: Achieves the clarity, color depth, and visual impact of top-tier commercial fashion campaigns.

{ "meta": { "title": "Heroic Dramatic Studio Portrait", "role": "World-class photographer specializing in editorial portraits", "aesthetic": "Dramatic, saturated studio lighting with a heroic feel" }, "constraints": { "identity_anchor": { "source": "ATTACHED REFERENCE PHOTO", "strictness": "Critical", "instruction": "Perfectly preserve exact facial features, skin tone, hairstyle, and natural likeness without changes." }, "clothing_anchor": { "source": "EXACTLY AS IN REFERENCE PHOTO", "strictness": "Critical", "instruction": "Style, color, material, and fit must fully correspond to the original without stylization." } }, "subject_details": { "expression": "Serious, tense, focused", "gaze": "Directed away from camera, into space above", "universality": "Lighting and angle applicable regardless of gender" }, "composition": { "background": "Saturated solid orange-red backdrop, smooth intense gradients, no patterns, 'hot' atmosphere", "camera_angle": "Low angle (shot from below upwards) to create dominance", "framing": "Medium close-up, emphasis on face and shoulders" }, "lighting": { "palette": "Dominant bright oranges and deep red shades", "key_light": "Strong directional source, deep dramatic shadows (chiaroscuro), emphasizing facial structure", "backlight": "Powerful expressive rim light or color halo separating subject from background", "mood": "Mysterious, tense, high-contrast" }, "technical_specs": { "style": "Photorealism, high detail", "focus": "Sharp focus on face vs smooth gradient background", "texture_quality": "Preserve natural pores and skin texture" }, "combined_prompt_text": "World-class editorial portrait, dramatic studio lighting, heroic feel. [REFERENCE PHOTO IDENTITY PRESERVED]: Exact facial features, skin tone, hairstyle. [CLOTHING EXACT MATCH]: No changes to fit or material. Expression: serious, tense, focused, looking up and away. Background: Saturated solid orange-red, smooth intense gradients, hot atmosphere. Camera Angle: Low angle shot from below for dominance. Framing: Medium close-up. Lighting: Bright orange and deep red palette, strong directional chiaroscuro key light, powerful rim light separating subject from background. Technical: Photorealistic, high detail, sharp focus on face, natural skin texture and pores." }

{ "image_type": "Photograph, Mirror Selfie", "shot": "Medium Shot", "shot_details": "Handheld mirror reflection, slightly tilted angle", "style": "Modern Lifestyle, Social Media Aesthetic", "quality": "High-fidelity, mobile photography style", "color_grade": "Warm indoor tones, saturated burgundy", "meta": { "aspect_ratio": "9:16", "resolution": "4k" }, "camera": { "device": "iPhone 17 Pro", "lens": "24mm f/1.8 Wide Angle", "aperture": "f/1.8", "distance": "Arm's length", "angle": "Eye level, slightly tilted", "framing": "Upper body and torso", "pov": "Subject looking into a mirror", "focus": "Sharp on face and eyes", "lens_effect": "Natural mobile sensor depth of field" }, "lighting": { "description": "Warm indoor vanity lighting from a bathroom mirror", "type": "Soft diffused light", "source": "Overhead and side vanity bulbs", "primary": "Warm frontal glow", "secondary": "Ambient indoor light", "highlights": "Soft specular highlights on the shoulders and hair", "shadows": "Soft shadows under the jaw and dress folds" }, "scene": { "location": "Upscale Bathroom or Dressing Room", "environment": "Interior, marble-textured walls, white bathrobes hanging in the background", "time": "Evening", "atmosphere": "Casual, playful, intimate" }, "subject": { "gender": "Female", "age": "Early 20s", "ethnicity": "Caucasian", "appearance": "Glamorous, fit, light tan", "body": { "type": "Slim, athletic", "skin": "Smooth, radiant" }, "expression": { "eyes": "Almond-shaped, focused on the phone screen", "gaze": "Indirect via mirror", "face_vibe": "Playful, cheeky, tongue sticking out" }, "hair": { "color": "Honey blonde with darker roots and highlights", "style": "Long, voluminous, loose waves" }, "pose": { "description": "Leaning slightly forward, one arm holding the phone, other arm supporting the lean" } }, "wardrobe": "Elegant maroon bodycon cocktail dress", "outfit_details": { "top": { "type": "Sweetheart neckline with spaghetti straps", "color": "Deep Burgundy / Maroon" }, "bottom": { "type": "Bodycon fit", "color": "Maroon" } }, "accessories": { "jewelry": [ "Gold hoop earrings", "Gold chain with a small cross pendant", "Stacked gold bracelets", "Tortoise-shell pattern phone case" ] }, "details": { "skin_effects": [ "Subtle pores", "Dewy finish" ], "realism": "High, organic skin texture, natural highlights on the tongue" }, "negatives": "Low resolution, blurry, distorted face, messy background, oversaturated skin, plastic texture" }

[Core Concept] Based on the uploaded subject and outfit reference, generate an ultra-photorealistic outdoor hot spring portrait. The theme is “Goddess in the Spring,” capturing a serene moment of the subject immersed in a crystal-clear turquoise hot spring, creating a dreamy, warm, and intimate healing atmosphere. Strictly preserve the subject’s facial features, structure, and body proportions, and accurately reproduce all outfit details. [Subject and Pose] The subject stands in waist-deep hot spring water, body angled in a 3/4 orientation toward the camera, forming a subtle S-curve through the torso. Both arms extend outward to the sides, fingertips lightly touching the water surface, creating gentle ripples. The head tilts upward approximately 15 degrees, looking directly at the camera with a calm, soft expression and slightly parted lips. Long hair (matching the uploaded subject’s color and style) flows behind the shoulders, with the ends floating on the water surface. A large olive-green flower (peony or lotus) is placed behind the left ear. The gaze is warm and inviting, shoulders relaxed and slightly drawn back. [Wardrobe Styling] Strictly follow the uploaded outfit reference: a deep olive-green one-piece swimsuit with all-over letter prints. Details include: halter neckline, crisscross lace-up detail at the chest, side cut-outs at the waist, and a repeating tonal letter pattern across the fabric. The color is deep olive green with matching tonal typography. [Composition and Depth of Field] Use a 4:5 vertical frame with a medium-close composition (waist-up). The camera is positioned slightly above the hot spring edge, angled downward at 20–30 degrees. The image is structured in three layers: * Foreground: shimmering water surface with caustic light patterns projected onto the skin and swimsuit. * Midground: sharply focused subject. * Background: lush green trees with hints of golden autumn leaves, along with wooden resort structures. The water naturally surrounds the lower body, forming a framing effect. Use a 50mm lens at f/2.0 to achieve shallow depth of field and subject emphasis. [Lighting and Color] Lighting is natural golden-hour backlighting, coming from above and behind the subject at a 45-degree angle on the right. Effects include: 1. A golden rim light outlining hair and shoulders. 2. Steam particles creating visible volumetric light rays (Tyndall effect). 3. Sparkling highlights on the water surface. 4. Sunlight refracting through moving water, casting realistic caustic patterns onto skin and fabric. Color grading follows Fujifilm “Nostalgic Negative” style: warm, nostalgic tones with soft glowing highlights, rich but natural greens, and warm, accurate skin tones. Avoid high contrast; maintain a soft, dreamy tonal range. [Environment and Atmosphere] The setting is a luxurious outdoor hot spring surrounded by dense tropical vegetation. Multi-level wooden resort structures are visible in the background. Steam and mist rise from the water surface, catching the golden backlight to create a soft, ethereal glow. The water is exceptionally clear with a turquoise hue, revealing rocks beneath and allowing accurate caustic light patterns to be visible. [Technical Constraints] Achieve ultra-photorealistic quality. Avoid any form of skin smoothing, plastic-like textures, murky water, overcast gray lighting, indoor pool appearance, oversaturated HDR effects, or illustrative/anime styles. Ensure physically accurate lighting behavior, realistic steam diffusion, and correct underwater caustic projection.

Using the uploaded real female subject as the visual baseline, generate a photorealistic portrait with a soft, intimate atmosphere. The image captures a quiet morning moment by the window, where the subject rests lazily in gentle natural light. Through minimal styling and the interaction of light and shadow, create a calm, elegant, and relaxed aesthetic expression. Character Baseline and Styling Baseline: Strictly follow the facial features, structure, and overall aura of the woman in the uploaded image. Present a recognizable face consistent with approximately a 25-year-old appearance. Expression: Soft and serene, with a direct gaze that conveys calm introspection. Skin should appear hydrated with subtle visible pores and natural highlights on the cheekbones. Physique: Slender body with natural skin tone; under soft lighting, fine skin texture should be visible on the arms and legs. Hair: Voluminous light blonde wavy hair with curtain bangs; slightly tousled with a natural, lived-in texture, with a few strands catching the light. Clothing: Top: A lightweight white collared button-up shirt, worn open and relaxed with rolled-up sleeves. Inner Layer: A simple white cotton tank top. Bottom: White cotton shorts or relaxed trousers with a minimalist design. Socks: Sheer white ankle socks. Accessories: Optionally include a minimal, nearly invisible delicate ring. Overall Style: Soft intimate portraiture with understated elegance. Pose, Environment, and Lighting Pose: Sitting gracefully on a cushioned windowsill, knees drawn toward the chest. One arm supports the body from behind, while the other rests naturally along the legs. Environment: A bright, airy indoor window corner with white wooden frames. Through the window, a softly blurred outdoor scene is visible, featuring a classic wrought-iron Parisian-style balcony railing and gentle greenery. Lighting: Natural morning light streaming through the window, creating soft diffused glow, gentle shadows, and warm highlights. Color Palette: Soft tones of milky white, warm beige, and natural skin tones, accented by subtle greens in the background. Atmosphere: Dreamy, intimate, and tranquil. Composition, Quality, and Style Composition: Eye-level medium shot using a 50mm prime lens for natural proportions. Technical: Aperture set to f/2.8 to achieve a soft, shallow depth of field. Image Quality: Ultra-photorealistic, RAW-style rendering at 8K resolution with extremely high detail. Accurately capture the delicate texture and translucency of the shirt fabric, the softness of cotton materials, the sheen of hair, realistic skin texture, and the interplay of light and shadow beyond the window. Rendering quality should meet standards of volumetric lighting, ray-traced reflections, ultra-realistic textures, medium-format (Hasselblad-like) photography, and subtle film grain. Style: Hyper-realistic, soft narrative, high-end intimate portrait photography.