
Reference Images:
{ "type": "image_generation_prompt", "style": "photorealistic, Japanese cultural travel, natural daylight", "identity_preservation": { "use_reference_image": true, "alter_face": false, "strict_identity_lock": true }, "subject": { "gender": "female", "pose": { "stance": "standing along a traditional wooden street", "posture": "graceful, relaxed" } }, "environment": { "location": "Gion, Kyoto", "background": { "architecture": "wooden machiya houses", "atmosphere": "quiet, timeless, cultural" } }, "lighting": { "type": "natural daylight", "quality": "soft and diffused" } }

Using the uploaded real subject as the visual baseline, create an ultra-high-detail, photorealistic portrait with a soft, atmospheric mood. Aim for a warm vintage film aesthetic, shallow depth of field, and cinematic color grading in 4K resolution, evoking a relaxed, introspective, and slightly melancholic intimate atmosphere. Character Setup Baseline: Strictly base the subject on the uploaded real female, preserving her facial features, proportions, and expression. The body should appear slender with natural curves. Hair and Makeup: Long hair with a slightly tousled look. Makeup is natural and minimal, with soft, well-defined brows. Fingernails are painted in a dark color, and a small silver ring is worn on the left hand. Expression: Looking directly at the camera with a gentle, subtle smile. Styling and Pose Outfit: An oversized, thick chunky knit sweater with rich texture, blending pink, dark gray, and white yarns, worn as a short dress. Paired with thick light gray ribbed over-the-knee knitted socks. Pose: Sitting on a lightly patterned gray granite kitchen countertop. Knees are drawn tightly to the chest, with the right arm wrapped around the legs. The left hand holds a simple pink coffee cup near the shoulder. Scene and Lighting Environment: A cozy kitchen in soft daytime lighting. The background is blurred but recognizable, including white cabinets with round silver handles, a stainless steel stove with a range hood and oven, a large multi-pane window, and hanging cookware in the upper left. On the countertop behind her are a simple black cup, a red electric kettle, and a transparent glass bottle. Lighting: Soft, diffused natural daylight enters from behind and from the left-side window, casting gentle highlights on her face and the knit texture of the clothing. Color and Atmosphere Color Palette: Emphasize charcoal gray, pink, warm cream tones, black, and soft red, creating a warm, slightly faded vintage film aesthetic. Mood: Relaxed, introspective, with a subtle melancholic tone. Photography and Image Quality Composition: 3:4 vertical format, medium-to-wide full-body shot, eye-level perspective. Simulate a modern smartphone camera with shallow depth of field. Focus and Quality: Sharp focus on the subject’s face and eyes, with cinematic color grading and realistic detail rendering at 4K resolution.

Intelligent professional portrait headshot in the style of American ID photography. The subject size is balanced and natural within the frame. Use a subtle light gray to white gradient studio background. Lighting is soft and natural, highlighting realistic skin tones and gentle depth. The image should be clear and high-quality with sharp facial focus. Skin texture appears clean, luminous, and healthy. The head-to-shoulder proportion is comfortable and natural. The overall mood is modern and elegant. The subject has a relaxed, natural, and confident expression with bright, lively eyes. Medium portrait framing with the subject centered in the composition. Low contrast lighting to create a refined, professional studio portrait aesthetic. Suitable for business and professional profile photos. Aspect ratio: 3:4.

Using the uploaded real female subject as the visual baseline, generate an immersive, story-rich environmental portrait with a warm and comforting atmosphere. The scene is set by a sunlit window inside a secondhand bookstore, capturing a quiet afternoon moment shared between a young woman and a cat. The overall style should be cinematic, highly detailed, and photorealistic, using light, texture, and everyday elements to create a cozy, healing, and narrative-driven visual experience. Character Baseline and Styling Baseline: Strictly follow the facial features, structure, and youthful aura of the woman in the uploaded image. She should have long chestnut-brown hair. Expression: Calm and relaxed, immersed in the quiet rhythm of reading and rest. Hair: Long chestnut-brown hair, naturally falling. Clothing: A cream-colored oversized sweater, large enough to cover her hands, enhancing a sense of softness, comfort, and enclosure. Scene, Details, and Interaction Primary Setting: A sunlit secondhand bookstore. The subject is seated on a slightly dusty windowsill. Core Props and Interaction: Books: Vintage manga scattered beneath and around her. An open copy of “Nausicaä of the Valley of the Wind” lies beside her. Cat: A black-and-white tuxedo cat curled up asleep by her legs, with one paw resting gently on the open manga page. Subtle details such as the faint twitching of the cat’s whiskers during sleep should be captured. Action: The subject is sipping iced coffee from a ceramic cup, with visible vapor or condensation gently rising from the cup. Environmental Details: Outside the window is a soft baby-blue sky, with a distant clock tower suggesting midday through lighting cues. The aged yellowed pages of old books, the knit texture of the sweater, and the fine details of the cat’s fur must all be rendered with high clarity. Lighting, Composition, and Image Quality Lighting: Bright natural sunlight streams through the old bookstore window, casting patterned, lattice-like shadows across the subject’s face. Light is the dominant visual element, creating a warm, airy atmosphere that emphasizes a “comfortable afternoon” mood. Composition: Medium-close framing focused on the subject, the cat, and the windowsill area. The perspective should resemble an environmental portrait with strong narrative presence. Color and Tone: Warm overall color palette, soft lighting, delicate shadows, and naturally balanced saturation, highlighting a cozy and nostalgic feeling. Image Quality: 8K ultra-high resolution with extreme detail. Must clearly render volumetric light (Tyndall effect), sweater fibers, aged paper textures, cat fur, glass surface imperfections, and fine dust particles suspended in the air. Mood and Atmosphere Core Mood: Healing, tranquil, lazy, nostalgic, and rich with literary and everyday life ambiance. The image should convey a sense of time slowing down, with a perfect moment of solitude shared harmoniously between a person, her pet, and her books.

Using the real female subject you uploaded as the visual baseline, generate an ultra-high-precision photorealistic portrait that blends fantasy storytelling with high-fashion aesthetics. The character is reimagined as a modern Cupid, poised elegantly among the clouds, creating a visual spectacle that is ethereal, dramatic, and fashion-forward. Character Baseline and Styling Baseline: Strictly follow the facial contours, proportions, skin texture, and overall aura from the reference image. Age and ethnic characteristics must remain consistent with the original. Expression: Enhanced soft-glow makeup that remains faithful to her natural features. Expression is focused, with sharp, intent eyes suggesting precise aim. Hairstyle: Fully replicate the hair color, length, texture, and natural form from the reference image. Wardrobe, Accessories, and Pose Main Dress: A fantasy-inspired haute couture gown. Design: Structured, intricately detailed strapless bodice with a long, flowing train. Material: Constructed from layered, realistic soft pink cherry blossom petals, creating a light, semi-transparent, airy effect with clearly defined petal textures. Color: Dominated by soft sakura pink, blended with crimson and nude pink in a natural gradient. Wings: Large petal wings composed of countless cherry blossom petals arranged in a dimensional collage, transitioning naturally from pale pink to near-white. Each petal must exhibit realistic texture and light transmission. Footwear: Ultra-thin high-heeled decorative sandals, wrapped with vines and petals, accented with white feathers. Weapon Accessory: Holding an ornate gilded longbow with intricate engravings. The arrow tip is set with a heart-shaped gemstone, emitting a soft pink-gold glow. Pose and Composition Pose: The subject sits gracefully atop a fluffy cloud. The right leg is bent for support, while the left leg extends forward. The upper body leans slightly forward as she draws the bow, combining elegance with controlled strength. Composition: Medium shot with the subject centered in the frame, wings fully extended. Eye-level perspective. Sharp focus on the face and bow, with the surrounding clouds slightly softened. Environment and Lighting Environment: Suspended in a bright sky, seated on voluminous clouds. The background features a vivid blue sky with scattered floating cherry blossom petals, creating a dreamy, weightless atmosphere. Lighting: Soft directional diffused lighting. The key light from above at an angle defines the silhouette, complemented by fill light to maintain clean shadows. Light creates crystalline highlights on gemstones, metal surfaces, and petals, resulting in an airy yet dimensional tonal quality. Style and Texture Visual: Photography-grade hyperrealism with cinematic impact and extreme detail fidelity. Detail: Supports 8K-level rendering. All materials—skin, petals, metal—must achieve high physical realism. Fusion: Seamlessly combines high-fashion aesthetics, mythological drama, and romantic fantasy imagination. Quality: Achieves the clarity, color depth, and visual impact of top-tier commercial fashion campaigns.

A high-angle tilt-shift cinematic image of the character captured naturally doing an everyday activity. The moment must feel candid and unstaged, like a quiet slice of life observed from above. Likeness Preservation Maintain exact proportions, eye shape, spacing, stylization, and material identity from the reference. Do not humanize. Do not reinterpret anatomy. Do not exaggerate realism. Camera & Perspective High-angle, slightly offset bird’s-eye view. Subtle isometric feeling. Equivalent to a 35mm lens adapted for tilt-shift photography. The character fills approximately 60–65% of the frame vertically. Breathing room above the head. Visible ground plane beneath. No extreme distortion. Focus Discipline (CRITICAL) Extremely shallow depth of field. Only the face and upper torso must be razor sharp. Hands interacting with an object may also be sharp. Lower legs and shoes must be slightly softened. Ground surface must progressively soften toward the bottom edge. Top and bottom edges must show strong optical tilt-shift blur. The focus band must be narrow and centered horizontally. No full-body sharpness. Pose Behavior The character must be naturally engaged in a daily action. Movement must feel subtle and believable. No staged hero poses. No exaggerated symmetry. Silhouette must remain clean and readable. Props or minimal furniture may be included only if necessary to support the action. Keep them simple and secondary. Lighting Control Bright natural daylight with firm directional key light. Light must carve the form clearly. Defined shadow under feet or object. Crisp shadow edge on ground. Subtle rim light separating character from background. No flat studio wash. No overly diffused light. Background Discipline Abstract minimal environment. Soft tonal gradient. Slight atmospheric depth. No detailed storytelling clutter. No heavy furniture unless essential to action. Background must support the character, not compete. Background Directional Structure (CRITICAL) Upper third of frame must contain soft luminous bokeh shapes or circular light blobs, gently out of focus, slightly brighter than mid-ground. Mid-ground remains neutral and unobtrusive. Lower ground plane must have subtle diagonal orientation, with a light-to-dark gradient guiding the eye upward toward the character’s head. Ground texture may contain faint linear perspective cues, but remain soft and minimal. The tonal gradient must travel from slightly darker bottom edge toward brighter mid-upper region behind the head. No flat monotone background. Color & Material Character slightly higher saturation than background. Background slightly muted. Clean premium collectible finish. Subtle surface sheen. No gritty realism. No artificial wear. Emotion Calm presence. Quiet confidence. A simple daily act elevated through composition and light. Not dramatic. Not heroic. Intimate but cinematic. Negative Lock No full-body sharpness No flat background No cluttered environment No stiff symmetrical pose No soft diffused lighting No dramatic battle energy No exaggerated realism

Using the uploaded real subject as the visual baseline, create a full-body portrait with an independent, rebellious aesthetic and a vintage film texture. Embrace a 1990s point-and-shoot camera style, using direct flash, high-contrast lighting, retro color grading, and casual composition to convey a raw, authentic, and edgy “cool girl” atmosphere. Character Setup (Based on the real uploaded subject) Baseline: Use the real female subject as the foundation, preserving her facial features and body proportions. Hair: Thick, slightly messy long hair (color matching the uploaded subject). Outfit: Sporty black cropped top with white trim, paired with matching black shorts and white over-the-knee socks. Accessories: A silver bracelet worn on the wrist. Pose and Composition Pose: Sitting casually on an elevated dark wooden surface (such as a counter or piano top). The body leans forward slightly, relaxed but with a sharp edge. One hand is raised near the mouth, lightly biting a finger. The gaze is directed straight at the camera, with a sensual yet nonchalant expression. Composition: Full-body shot using a 35mm focal length, slightly low-angle perspective to emphasize leg lines. Medium-distance framing, appearing spontaneous yet visually balanced. Sharp focus on the subject, with the background falling off due to the rapid light decay of direct flash. Scene and Details Background: A white wall decorated with vintage posters taped onto it, along with brown wooden blinds. Foreground and Props: Include visible elements such as the neck of an electric guitar, a tin of cookies, and several bottles to enhance the lived-in, candid atmosphere. Lighting and Color Lighting: Simulate direct on-camera flash with hard lighting. The light should be harsh and frontal, casting sharp, well-defined shadows on the wall behind the subject. High contrast, with no soft lighting or diffusion. Color and Texture: Vintage film aesthetic, emulating Kodak Gold 200 tones. Warm hues from wooden blinds and furniture contrast with the cool white flash. Colors should be deep and slightly muted, with rich blacks and subtle film grain, creating a lo-fi indie visual style. Overall Mood and Style Style: Independent, rebellious “cool girl” aesthetic. Raw, authentic, with a Y2K-inspired retro fashion editorial vibe, resembling a snapshot. Mood: Casual, direct, with a hint of playful defiance. Technical Specifications and Constraints Image Quality: Photorealistic, but intentionally incorporating vintage film imperfections and direct-flash artifacts. Avoid: Soft lighting, studio lighting setups, shallow depth-of-field bokeh effects, polished studio portraits, skin smoothing, 3D-rendered looks, cartoon or illustration styles, distorted hands, missing guitar strings, floating objects, anatomical inaccuracies, stiff posing, excessive post-processing, and HDR effects.

[Core Concept] Style Direction: Ultra-photorealistic modern sports photography in a minimalist studio setting, blending professional athletic aesthetics with high-fashion editorial sensibility. Theme Atmosphere: Focused, calm, and extended—capturing the tension and balance of the body in motion. Reference Baseline: Strictly use the uploaded subject as the foundation, accurately reproducing facial features, structure, body proportions, and personal aura. [Composition and Framing] Aspect Ratio: 3:4 vertical composition. Camera Angle: Medium full-body seated shot, eye-level to slightly top-down perspective. Scene Structure: Subject centered in the frame, with a seamless light gray/off-white background and floor forming a clean, minimal environment. Depth of Field: Subject in sharp focus, background softly blurred to emphasize the figure and movement details. [Emotion and Narrative] Core Emotion: Focus, calmness, inner strength. Visual Narrative: Through precise posture, minimal surroundings, and soft lighting, convey a moment of “self-dialogue within motion.” [Character and Styling] Facial Features: Faithfully reproduce the subject’s facial structure and expression—calm and focused, gaze naturally directed downward toward the extended foot. Hair and Makeup: Loose high bun with soft wavy strands falling along both sides of the face. Makeup is light and natural, emphasizing authentic skin texture. Outfit: * Olive green two-piece athletic set: deep V-neck sports top and high-waisted fitted leggings. * Fabric closely follows the body, highlighting athletic curves. Accessories: * White ribbed mid-calf socks with black “NEVER LOSE” text at the ankle. * Retro chunky white sneakers (New Balance style), with visible rubber outsole texture. * Metallic silver long nails. * Small chunky gold hoop earrings, a fine gold chain with a dark pendant, and a short pearl necklace. [Pose and Movement] Core Pose: Seated forward stretch. * Right leg fully extended forward, heel on the ground, toes pointing upward. * Right hand reaches forward, gripping the front of the right shoe. * Left leg bent inward, knee touching the ground. * Left hand rests lightly on the left thigh. * Torso leans forward into the stretch. Body Expression: Demonstrates flexibility, control, and strength, with naturally flowing muscle lines. [Lighting and Environment] Setting: Minimalist studio with seamless infinity background in light gray or off-white. Lighting: Soft, even diffused studio lighting. * Subtle highlights on collarbones, shoulders, and hair. * Soft, light shadows cast on the right side and behind the subject on the floor. * Overall lighting remains natural and avoids harsh contrasts. Color Palette: Olive green, white, light gray, beige, silver, and gold—forming a refined and harmonious tonal system. [Technical Specifications] Lens Simulation: Portrait lens with f/2.8–f/4.0 aperture. Image Quality: 8K resolution with ultra-high detail. Rendering Requirements: Skin texture, subtle sweat sheen, individual hair strands, fabric fibers, and accessory materials must be clearly visible. Visual Style: Professional-grade photography—clean, sharp, tonally cohesive, with the aesthetic quality of a premium sportswear advertising campaign.

Core Concept Using the uploaded real female subject as the visual baseline, generate a high-fashion editorial image infused with Y2K retro aesthetics and strong visual contrast. The scene captures the subject in a summer outdoor setting, interacting with a pink vintage convertible covered in decorative foam, creating a lively, nostalgic, and fashion-forward visual narrative. Character Baseline and Styling Baseline: Strictly follow the facial features, structure, and personal aura of the subject in the uploaded image. Expression: Confident with a playful edge. Skin: Naturally radiant with a healthy glow. Physique: Well-proportioned and fit, with a natural forward-leaning posture. Hair: Long, dynamic wavy hair, gently lifted by a light breeze for a sense of motion. Outfit: Bright pink two-piece casual set paired with matching stylish sandals. Accessories: Fashion sunglasses, a delicate necklace, and minimal rings. Pose, Scene, and Lighting Pose: The subject leans naturally against the car, with both hands lightly touching the foam-covered surface. Scene: Bright outdoor setting featuring a pink vintage convertible coated in decorative foam as the central prop. Environment: Slightly wet ground reflecting ambient daylight, creating natural reflective highlights. Lighting: Bright natural sunlight producing balanced highlights and shadows, enhancing contrast and dimensionality. Color Palette: Dominant pink tones complemented by blue sky and white foam, forming a harmonious and vibrant color scheme. Atmosphere: Playful, nostalgic, and stylish. Composition, Quality, and Style Composition: Eye-level perspective with a slightly wide-angle feel. Technical Settings: Aperture at f/4.0 to maintain clarity across the frame. Image Quality: High-detail photorealism with strong emphasis on material textures and light behavior. Visual Style: Y2K-inspired retro fashion with cinematic storytelling.