
Reference Images:
Ultra-cute kawaii fantasy studio entirely made from crochet, yarn, or knitted material. Pastel, dreamy, playful knitted wonderland inspired by classic video games. Background: layered knitted elements including giant crocheted clouds, soft rounded hills, stars, rings, swirls, floating yarn pom-poms, stitched symbols. All plush, rounded, bubbly, fully knitted with premium yarn texture. Center: young woman [use uploaded face reference] sitting on giant crocheted chibi animal chair, rounded head, tiny stitched features, embroidered big kawaii eyes, stubby limbs. Pose: knees together slightly angled, feet inward, body leaning slightly forward, one arm hugging plush crocheted doll of the animal, other hand pinching its cheek or holding ear. Expression: playful, cute, soft smile, slightly puffed cheeks, bright sparkling eyes, innocent and cheerful. Outfit: slightly sexy yet cute, fully knitted cropped oversized pastel blue sweater, high-waist knitted mini skirt with soft pleats, chunky knitted socks, crocheted snea

Style Direction: Ultra-photorealistic cinematic lifestyle photography, blending Japanese fresh minimalism with Korean tonal aesthetics. Emphasize a sense of relaxation and emotional softness, with a clean, gentle visual style that combines fashion editorial storytelling with everyday warmth. Theme and Atmosphere: A quiet, lazy afternoon by the window, conveying calmness, softness, and intimacy. Use light and body language to express a “healing daily life” mood. Reference Baseline: Strictly use the uploaded subject as the sole identity reference. Accurately reproduce facial structure, proportions, skin tone, hairstyle, and all recognizable features. No stylization or alteration of the real face. Aspect Ratio and Framing: 4:5 vertical composition. Full-body or half-body framing depending on the reference, prioritizing upper body and leg interaction. Centered alignment with balanced visual weight. Subject Placement: The subject leans by the window in a relaxed posture, occupying the right two-thirds of the frame. The left side is left open to show a softly blurred outdoor street scene, creating an emotional contrast between interior and exterior. Interaction Design: Relaxed posture, hands naturally overlapped and resting between the legs. Back gently leaning against the wall, legs slightly bent, conveying a casual, lived-in interaction. Real Subject (Photorealistic Rendering) Facial Fidelity: 100% preservation of original facial structure, proportions, skin tone, and hair color. Skin texture must be realistic, with visible pores and natural light transitions, avoiding any artificial smoothing. Eyes appear soft and slightly unfocused, with a faint smile conveying relaxed ease. Expression: Calm, slightly distant, with a soft, dreamy gaze and naturally relaxed lips, expressing a gentle and healing presence. Pose: Based on the reference sitting pose — back lightly against the wall, legs slightly bent and overlapped, hands resting naturally on the legs, shoulders fully relaxed with no stiffness. Wardrobe Reconstruction: Inner Layer: White spaghetti-strap tank top with ribbed texture, closely fitting the skin with visible material detail. Outer Layer: Blue-and-white striped shirt, loose fit, sleeves slightly rolled, natural fabric folds, evenly spaced stripes. Bottom: Likely shorts or a short skirt (frame limited to legs), maintaining a casual homewear aesthetic. Material Rendering: Skin-contact softness of the tank top, cotton texture of the shirt, and gravity-driven folds must all be clearly visible and realistic. Lighting and Environment Lighting Setup: Side-back natural light simulating sunlight from the window on the left. Light grazes the face, shoulders, and clothing, creating soft highlights, while shadows transition naturally (e.g., neck, inner arms), enhancing depth and atmosphere. Background: Interior: Light gray matte wall paired with an off-white cushion (soft, voluminous, with natural folds under light). Exterior: Blurred street scene outside the window, with faint silhouettes of buildings and pedestrians in cooler tones, contrasting with the warm interior light. Environmental Details: Dark window frame (metal or wood texture), clean windowsill, realistic cushion volume and folds, minimal clutter to maintain a clean, minimalist home aesthetic. Output Style Image Quality: 8K resolution with RAW-level detail. Hair strands, skin pores, fabric textures, and light gradients must be rendered with extreme realism. Color Grading: Low-saturation soft tones dominated by off-white, light gray, and pale blue, with warm yellow highlights simulating afternoon sunlight. Overall palette should feel fresh, calm, and healing. Post-processing Look: Cinematic grading with preserved shadow detail, soft highlights, and moderate contrast to reinforce the relaxed emotional tone. Lens Language: 35mm prime lens, f/2.8 aperture, with slight background blur and natural depth transition, keeping the subject as the visual focus.

Using the uploaded real female subject as the visual baseline, generate a photorealistic portrait with a soft, intimate atmosphere. The image captures a quiet morning moment by the window, where the subject rests lazily in gentle natural light. Through minimal styling and the interaction of light and shadow, create a calm, elegant, and relaxed aesthetic expression. Character Baseline and Styling Baseline: Strictly follow the facial features, structure, and overall aura of the woman in the uploaded image. Present a recognizable face consistent with approximately a 25-year-old appearance. Expression: Soft and serene, with a direct gaze that conveys calm introspection. Skin should appear hydrated with subtle visible pores and natural highlights on the cheekbones. Physique: Slender body with natural skin tone; under soft lighting, fine skin texture should be visible on the arms and legs. Hair: Voluminous light blonde wavy hair with curtain bangs; slightly tousled with a natural, lived-in texture, with a few strands catching the light. Clothing: Top: A lightweight white collared button-up shirt, worn open and relaxed with rolled-up sleeves. Inner Layer: A simple white cotton tank top. Bottom: White cotton shorts or relaxed trousers with a minimalist design. Socks: Sheer white ankle socks. Accessories: Optionally include a minimal, nearly invisible delicate ring. Overall Style: Soft intimate portraiture with understated elegance. Pose, Environment, and Lighting Pose: Sitting gracefully on a cushioned windowsill, knees drawn toward the chest. One arm supports the body from behind, while the other rests naturally along the legs. Environment: A bright, airy indoor window corner with white wooden frames. Through the window, a softly blurred outdoor scene is visible, featuring a classic wrought-iron Parisian-style balcony railing and gentle greenery. Lighting: Natural morning light streaming through the window, creating soft diffused glow, gentle shadows, and warm highlights. Color Palette: Soft tones of milky white, warm beige, and natural skin tones, accented by subtle greens in the background. Atmosphere: Dreamy, intimate, and tranquil. Composition, Quality, and Style Composition: Eye-level medium shot using a 50mm prime lens for natural proportions. Technical: Aperture set to f/2.8 to achieve a soft, shallow depth of field. Image Quality: Ultra-photorealistic, RAW-style rendering at 8K resolution with extremely high detail. Accurately capture the delicate texture and translucency of the shirt fabric, the softness of cotton materials, the sheen of hair, realistic skin texture, and the interplay of light and shadow beyond the window. Rendering quality should meet standards of volumetric lighting, ray-traced reflections, ultra-realistic textures, medium-format (Hasselblad-like) photography, and subtle film grain. Style: Hyper-realistic, soft narrative, high-end intimate portrait photography.

Using the real female subject you uploaded as the visual baseline, generate an ultra-high-detail, photography-grade realistic fashion portrait. The image should focus on a dynamic low squatting pose, presenting a sensual yet high-fashion outfit against a pure white soft-lit background. The overall mood should blend sensuality, confidence, and avant-garde fashion, with detail rendered to an extreme 8K level. Character Baseline and Styling Baseline: Strictly follow the facial features, structure, and personal aura of the woman in the uploaded image. Her distinctive facial contours must be clearly recognizable. Expression: Sensual and confident, with slightly parted lips. Makeup should be a captivating nude glam style, emphasizing highlighted cheekbones, subtly smoky eye makeup, and glossy, hydrated lips. Hairstyle: Voluminous, softly curled long hair, styled into loose waves, side-parted and falling naturally over the shoulders. Clothing, Accessories, and Pose Outfit: Top: A minimalist black long-sleeve top with a wide neckline. Bottom: Paired with white underwear, layered with white lace tights covering the legs and feet; the lace should display a delicate semi-transparent texture. Footwear: A pair of black stiletto heeled sandals, featuring ankle straps and embellished toe straps with sparkling accents, worn over the lace tights. Accessories: Necklaces: A bold, sparkling rhinestone choker, paired with a long, slender Y-shaped necklace extending down toward the abdomen. Earrings: A pair of long dangling earrings. Pose: The subject is in a low squatting position. The right leg is deeply bent, while the left leg casually crosses over the right thigh. The right arm is extended to support the body on the ground, and the left hand rests loosely near the knee. The pose should convey strong balance, dynamic tension, and a sense of relaxed elegance. Environment and Lighting Environment: A clean white seamless studio background with soft diffusion, free of any distractions. Lighting: Bright, even studio lighting. The light should clearly define the body contours, the intricate textures of the clothing (especially the lace), and produce precise highlights and reflections on accessories such as rhinestones and metal, creating a clean, bright, and layered lighting effect. Composition and Image Quality Composition: Medium-to-close framing, focusing on the pose and upper body. The camera angle is slightly top-down to enhance the compositional tension and visual impact of the squatting pose. Technical: Photorealistic, ultra-detailed style. Sharp focus is required, accurately rendering all complex materials including skin texture, hair strands, lace patterns, jewelry facets, and the sheen of the footwear. Image Quality: 8K ultra-high resolution with extremely rich detail. The image must be clean and sharp, achieving the level of polish and artistic quality of top-tier fashion editorials. Mood and Style Overall Style: Sensual, high-end, avant-garde fashion editorial aesthetic. Atmosphere: Within a minimalist setting, use pose, lighting, and fine detail to convey a controlled sensual allure and an effortlessly rebellious fashion attitude.

MONUMENT: The Forbidden City Photorealistic hero product photography shot on Sony A7III with 85mm f/1.4 lens at f/2.8, soft natural daylight from upper left creating gentle realistic shadows and visible surface textures: A giant famous [MONUMENT] masterfully recreated as a delicious multi-tiered edible cake. Visible fluffy sponge cake layers, thick silky buttercream frosting, smooth fondant details, subtle edible gold leaf accents, realistic cake crumbs and ganache drips. The cake tiers ingeniously transformed into a luxurious miniature vibrant living city with tiny glowing windows, staircases, gardens and balconies built into the frosting and sponge. Miniature visitors in tiny colorful clothes walk the buttercream pathways, relax on cake terraces, sip tiny drinks and take photos. Intricate realistic cake textures mixed with whimsical architecture, physically plausible yet surreal. Shot for a luxury architecture magazine cover, ultra-detailed 8K, perfectly centered square composition, no text, no artifacts.

Using the uploaded real female subject as the visual baseline, generate an ultra-high-detail fashion portrait that blends balletcore aesthetics with Rococo-inspired fantasy. The scene is set in a classical interior, capturing an elegant moment of the subject wearing a pink-blue voluminous skirt, creating an ethereal, dreamlike, and haute couture visual narrative. Character Baseline and Styling * Baseline: Strictly follow the facial features, structure, and overall aura of the woman in the uploaded image. Preserve her distinctive identity and soft characteristics. * Expression: Soft and ethereal, with large, expressive eyes. Skin texture should be smooth with a natural luminous glow and subtle warm undertones. * Physique: Slender and elegant body, with realistic skin tones and soft highlights on the legs and upper chest. * Hair: Voluminous long blonde curls, appearing wind-swept. Hair strands should form a dynamic, airy halo effect around the head and shoulders. * Outfit: Top: A pale pink structured corset with front lace-up detailing and delicate ribbon-tied shoulder straps. Bottom: An ultra-short, highly voluminous multi-layered ballet tutu composed of gray-green and soft pink tulle layers. * Style: Romantic balletcore with a touch of Rococo whimsy. * Footwear: Minimalist white lace-up high heels with a refined ankle strap. Pose, Environment, and Lighting * Pose: Standing with the body slightly leaning against a white wall, ankles crossed to create an elegant silhouette. Arms placed behind the back, gently pushing the upper body forward. * Environment: A refined classical interior featuring white paneled double doors with decorative molding, and a polished light herringbone wood floor. * Lighting: Bright, diffused natural light entering from a side window, casting soft shadows and highlighting the delicate textures of tulle and lace. * Color Palette: Soft pastel tones including blush pink, muted gray-green, and clean gallery white. * Atmosphere: Dreamy, ethereal, with a high-fashion editorial mood. Composition, Quality, and Style * Composition: Eye-level full-body cinematic shot using a 50mm f/1.2 lens. * Technical: Aperture set to f/2.0 to achieve soft focus falloff. * Image Quality: Ultra-photorealistic, RAW-style rendering at 8K resolution with extreme detail. Precisely capture the transparency and layering of tulle, the satin sheen and texture of the corset, the curl and gloss of the hair, the fine details of skin, and the reflective quality of the wooden floor. Rendering quality should meet standards of volumetric lighting, ray-traced reflections, ultra-realistic textures, and medium-format (Hasselblad-like) photographic fidelity. * Style: Hyper-realistic, romantic narrative, haute couture fashion photography.

GLOBAL STYLE & QUALITY Ultra-high-end hyper-realistic fashion photography, cinematic editorial realism, natural optics, physically correct lighting, real skin micro-detail, zero stylization, zero CGI, zero illustration, no anime, no fantasy look, pure photographic dominance FACE Cold, controlled, dominant feminine face Sharp symmetrical features, porcelain skin with subtle warmth Eyes locked directly into the camera with predatory calm No smile, lips relaxed and closed — expression communicates “I am untouchable” Makeup minimal but sculpted: emphasis on eyes and bone structure BODY Elegant yet powerful feminine physique Strong posture despite relaxed pose Torso slightly forward, hips grounded — ownership of space Pose radiates confidence, not invitation Sex appeal comes from authority, not softness SKIN Fair luminous skin with realistic wet sheen Natural highlights from water on thighs, hips, and waist No artificial glow, no plastic smoothness Subtle contrast between dry upper skin and water-kissed lower body HAIR Long straight hair, dark roots transitioning to lighter ends Hair slightly damp near ends Clean parting, controlled flow No wind drama — calm, intentional presence CAMERA & ANGLE Medium-close to mid-body shot Camera positioned slightly below chest level Angle creates power imbalance in favor of subject Lens realism (85mm editorial look), shallow depth of field SETTING Ancient marble pool with pale stone architecture Classical columns and sculptural forms in background Bright daylight with soft diffusion Background clean, majestic, and secondary to subject SCENE SETUP Subject seated in shallow water One leg bent, one leg extended forward Waterline wrapping around thighs and hips Upper body upright, stable, unshaken She does not react to environment — environment submits to her presence OUTFIT (CRITICAL – IMAGE MATCH) White deep-V draped dress / swim-dress hybrid Fabric thin, flowing, soaked at lower half Deep plunging neckline emphasizing chest without exaggeration Dress clings naturally to body due to water weight Gold jewelry: • Elegant drop earrings • Fine layered gold necklaces • Gold arm cuff on upper arm No extra accessories, no modern clutter PROPS (FLATLAY FEEL) None — power is self-contained Jewelry acts as status markers, not decoration EXPRESSION & MOOD Aggressive elegance She is not seductive — she is commanding Mood feels calm, dangerous, superior Viewer feels observed, not invited Sexiness emerges from control and dominance ATMOSPHERE Bright but controlled daylight Soft shadows carve collarbones, shoulders, and thighs Water reflections subtly dancing on skin Clean exposure, no blown highlights FINAL RESULT GOAL A hyper-realistic, aggressively dominant BOLD portrait Feels like a luxury power-myth — goddess without fantasy No softness, no cuteness, no playfulness Pure authority, composure, and sexual confidence Indistinguishable from a high-budget editorial fashion photograph

Create a vibrant and artistic watercolor painting of a smiling person, using the uploaded photo as the facial reference without altering their facial features or appearance. The person is wearing a neutral-tone T-shirt. The background is clean white, filled with dynamic and expressive watercolor paint splashes in blue, orange, and purple spreading organically across the scene. In the foreground, a realistic human hand holds a paintbrush as if actively painting the sleeve of the man’s hoodie, creating the illusion that the portrait is being painted at that exact moment. The visual style combines detailed ink line art with loose watercolor textures, including paint drips, fine splatters, soft pigment diffusion, watercolor blooming effects, and natural paper texture. The image features strong contrast, an expressive and dynamic artistic look, a clear hand-painted aesthetic, centered composition, soft lighting, high level of detail, and a modern illustration style.

[Core Visual Concept] Ultra-high-definition professional live-action character design sheet. Based on the character features from the reference image, create an industrial-grade visual reference set intended for big-screen film production. The overall style must be locked to photorealistic live-action photography, strictly prohibiting any anime or stylized rendering, targeting 8K cinematic-level detail fidelity. [Three Core Sections of the Design Sheet] 1. Standard Turnaround Sheet View Specifications: Includes full-body standing views of the character from three angles: front, profile (side), and back. Visual Alignment: All views must maintain perfectly consistent proportions, ensuring character height, facial feature placement, and clothing folds align precisely across angles. Background Setting: Clean industrial gray or neutral background with subtle physical shadows to enhance depth. 2. Expression Sheet Emotion Matrix: Showcase six fundamental emotional states under realistic lighting conditions: joy, sadness, anger, surprise, fear, and neutral. Facial Realism: Focus on capturing realistic facial muscle movement, including natural forehead lines, compression around the eyes, and dynamic lip changes, while preserving skin texture and pore-level detail. 3. Action Pose Sheet Typical Actions: Depict the character in representative cinematic motions, including running, jumping, laughing, and crying. Gravity and Dynamics: Emphasize the physical behavior of clothing under motion, including natural draping and tension in fabric, as well as accurate muscle engagement, creating an authentic captured-in-motion feel. [Character Attributes and Texture Constraints] Identity Consistency: Strictly replicate the facial features, skin tone, hair texture, and body proportions from the reference image. Prohibited Styles: Absolutely no cel-shading, exaggerated anime proportions, or flat color blocking. Lighting Quality: Use neutral global illumination to ensure consistent color and visible detail across all angles, suitable for 3D modeling reference. [Execution Instructions and Output Specifications] Composition: Clean and organized multi-panel layout. Image Quality: Ultra-high resolution, RAW photographic texture, 8K UHD.