
Reference Images:
{ "meta": { "project": "Ski_Gondola_Egirl_Flux_V4.2", "target_engine": "Flux.1 [dev] / Nano Banana Pro", "version": "4.2.0 (Everything in Focus - f/11)", "created_at": "2025-12-18T15:35:00Z" }, "engine_configuration": { "model": { "base": "flux1-dev.safetensors", "quantization": "fp8 / nf4", "vae": "ae.safetensors" }, "lora_slots": [ { "name": "Realism_LoRA_v2 (Optional)", "strength": 0.5, "note": "Enhances porcelain skin tone, nylon textures, and snow reflections." } ], "sampling": { "sampler_name": "euler", "scheduler": "simple", "steps": 28, "guidance_scale": 2.5, "shift": 1.0 }, "dimensions": { "width": 1024, "height": 1536, "aspect_ratio": "2:3", "megapixel_class": "1.5MP" } }, "prompt_construction": { "narrative_layer": { "style": "Winter Lifestyle / Travel Photography", "instruction": "Capture a sharp, high-contrast shot inside a ski gondola, balancing the interior subject with the bright snowy mountain view outside.", "subject_flow": "A pale young woman with black wolf-cut hair wearing a white puffer jacket sitting in a cable car, touching her hair." }, "texture_layer": { "skin_physics": "pale porcelain skin, glossy lips, dramatic e-girl eyeliner, smooth finish", "fabric_physics": "shiny nylon texture of white puffer jacket, technical matte fabric of black ski pants, reflective lens of ski goggles", "environment_physics": "SHARP DETAILS ON BACKGROUND: clear glass window, white snow texture on mountains, dark green pine trees, blue sky" }, "camera_physics": { "lens_imperfections": "high contrast, sharp daylight, slight reflection on glass", "focus": "DEEP DEPTH OF FIELD (f/11) - NO BLUR. The woman, the gondola interior, and the distant snowy mountains are all sharp.", "settings": "Sony A7R V, 35mm Lens, 1/1000s, ISO 100 (Bright Snow Daylight)" }, "color_grading": { "white_balance": "Cool Daylight (Blue Sky/White Snow dominance)", "shadows": "Deep, defined shadows inside the cabin", "highlights": "Bright, crisp highlights on snow and jacket" } }, "final_prompt_string": "A candid raw lifestyle photograph shot on Sony A7R V 35mm f/11. Deep depth of field, everything in focus. A young woman (19-25) with pale porcelain skin and shoulder-length black hair with bangs (wolf cut) sitting inside a ski gondola. She wears a shiny white cropped puffer jacket, black ski pants, and black ski goggles on her head. She touches her hair behind her ear and looks at the camera with a calm expression. Dramatic e-girl makeup with winged eyeliner and glossy lips. Bright winter sunlight illuminates her face. Background is sharp and detailed: Through the large glass window, a panoramic view of snowy Alpine mountains, ski tracks, pine trees, and a clear blue sky is clearly visible with no bokeh. High contrast. Winter travel aesthetic.", "negative_prompt_string": "", "note_on_negative": "Flux ignores explicit negative prompts. Sharpness is ensured by positive descriptors like 'f/11' and 'deep depth of field'.", "post_processing": { "upscale": { "enabled": true, "method": "Magnific_AI_Style (Creativity: 1)" }, "face_restoration": { "enabled": false, "warning": "CRITICAL: DISABLE FACE RESTORATION." } } }

Use the woman's exact face without altering it. Black-and-white surreal editorial portrait: clean profile. Hand at chin in an elegant gesture. She wears a black textured voluminous top with a fur/feather effect, her hair is loose, and round earrings catch the light. Subtle horizontal motion blur runs across the top of the frame. Soft, diffused, high-contrast light: the face is sharp, the rest of the frame is cast into shadow. The background is minimalist, white with gray transitions. Close-up angle, shallow depth of field, glossy fashion style, slight grain. Negative: anatomy distortions, extra hands, incorrect lighting, lack of motion blur, cartoonish look, low contrast, overexposure, artifacts.

{ "subject_and_pose": { "description": "A photorealistic, highly detailed image of a beautiful young woman with shoulder-length hair featuring soft bangs. She has fair skin with a noticeable glossy, wet sheen, suggesting recent swimming activity. Her facial expression is calm and engaging, looking directly at the camera.", "pose": "She is kneeling on the tiled edge of a swimming pool, leaning slightly forward with her right hand resting flat on the pool deck for support and her left arm relaxed by her side. Her posture is poised and highlights an athletic full hourglass figure." }, "outfit": { "description": "She is wearing a tight, glossy one-piece competitive swimsuit.", "details": "The swimsuit is two-toned, featuring a deep royal blue center panel and vibrant bubblegum pink side panels that contour the body. The material has a high-shine, latex-like or wet lycra finish. A small pink logo is visible on the upper chest." }, "environment": { "setting": "An upscale outdoor resort pool area on a sunny day.", "background": "The immediate background features the sparkling blue water of the swimming pool with blue mosaic tiles visible beneath the surface. Further back, there is a poolside patio with white lounge chairs, lush green hedges, palm trees, and a villa-style structure with a wooden pergola roof, evoking a tropical summer atmosphere." }, "lighting_and_shot": { "lighting": "Bright, high-key natural sunlight. Strong specular highlights are present on the subject's wet skin and the glossy swimsuit, enhancing the texture and realism. The lighting creates defined but soft shadows, typical of a midday sun.", "camera_shot": "Medium-full shot, capturing the subject from the knees up. The angle is slightly elevated, looking down at the subject against the backdrop of the pool. The image is sharp and high-resolution with a shallow depth of field that keeps the subject in crisp focus while slightly blurring the background elements. Aspect Ratio 3:4." } }

Create a miniature 3D isometric diorama showing the invention of [Light Bulb] at the moment of [Edison successfully testing the first long-lasting carbon filament bulb]. Camera angle around 40° from above. Textures feel soft and polished. Materials follow realistic PBR rules. Lighting feels natural and balanced. The raised base includes tools, workshop elements, notes, and early prototypes. Tiny stylized inventors interact with objects. Faces are visible and recognizable with clean shapes and expressions. Background stays solid [warm dark navy blue]. Top center text shows [Light Bulb] in bold. Second line shows [Thomas Edison — 1879]. A simple line icon of the invention sits below. Text color adapts to background contrast.

I. Subject and mood objectives 1. Subject: young woman, medium-shot portrait; upper body dominates the frame, looking directly into the camera 2. Mood: sweet yet cool and playful, not childish; with a touch of confidence and spirited elegance 3. Expression and action: left eye winking, right eye open and expressive; subtle smile with slightly upturned lips 4. Gaze relationship: eyes sharply focused on the camera, creating strong interaction II. Composition and camera language 1. Shot size: medium shot, framing from the top of the head to the upper chest; face and hands visible in the same frame 2. Composition: centered composition, face centered slightly higher than mid-frame; arms positioned below to form support and enclosure 3. Camera angle: eye-level with a very slight top-down angle 4. Focal length: medium-to-long focal length for a half-body portrait look, with compressed background 5. Depth of field: * Focus point: eyes as the sharpest focus * Sharp zone: from both eyes to the tip of the nose * Softened zone: noticeable softening from the mouth to the ears and back hair * Hands: slightly softer than the face; knit texture visible but not crisp * Transition: smooth and gradual transition from sharp to blur, no hard cutoff * Quantified target: overall depth of field approximately 1–3 cm, enhancing layering without overdoing it 6. Aspect ratio (must be followed): vertical orientation; clear background space on both left and right sides; all limbs must remain within the frame III. Lighting and background 1. Light source: softbox-style diffused soft light; key light from the front and slightly above, with very light shadows 2. Fill light: gentle frontal fill to reduce nose and nasolabial shadows; overall high brightness 3. Contrast: low contrast with smooth tonal transitions; skin highlights delicate and not overexposed 4. Background: clean, light-colored background (slightly warm white), no visible texture or props 5. Color temperature: warm white light, overall creamy warm tone IV. Facial features and makeup details 1. Skin: clear and translucent with fine texture preserved; natural highlights on the bridge of the nose and cheekbones 2. Features: small freckles scattered across the cheeks and nose 3. Blush: large-area diffused blush from the apples of the cheeks to under the eyes; warm pink-apricot tone 4. Eye makeup: clean and transparent; long, lifted eyelashes; lower lashes clearly visible 5. Eyes: sharp and clear with clean sclera; overall natural and translucent appearance 6. Lip makeup: glossy, jelly-like lip glaze with highlights; color between warm rose-bean and peach-red, with soft edges V. Hairstyle and hair accessories (key replication) 1. Hairstyle: high-position, voluminous, slightly messy bun or updo, with flyaways and outward-flipped strands at the crown 2. Bangs: light, wispy bangs with a natural part; a few strands resting on the forehead and softly floating 3. Sides: loose strands falling on both sides to create a casual, airy feel 4. Hair accessories: multiple light-colored hair clips and small ornaments (flowers, bows, cream-toned mini clips) 5. Hair color: deep brown with a subtle warm sheen VI. Clothing and hand styling (key replication) 1. Top: dark sleeveless dress or tank-style top, with brown plaid details or straps 2. Gloves: beige-apricot knitted fingerless long hand warmers; thick texture with clearly visible knit pattern 3. Accessories: brown plaid bow decorations on the hand warmers; thin gold bracelet on the wrist 4. Hand gestures: playful, cute, and spontaneous hand poses 5. Nails: long transparent extension nails with teal-green French tips or accents, featuring small rhinestone sparkle details VII. Color and texture style 1. Primary colors: creamy white background, warm brown hair, beige-apricot knit, brown plaid accents 2. Texture quality: soft focus without muddiness; skin smooth like cream; knit and plaid textures clearly defined 3. Image processing: light skin smoothing and brightening while preserving freckles; overall fresh, clean, and translucent look

Create a highly detailed isometric 3D interior diorama of a (Japanese wabi-sabi) (traditional tea room) Design the space with coherent architecture, furniture, lighting, materials, and decoration that naturally match the chosen concept. Use realistic textures, soft natural daylight, gentle shadows, and a calm minimal white background. Perspective: isometric cutaway diorama. Ultra high detail, premium interior visualization, soft global illumination.

[INPUT IMAGE: USER_PHOTO] Use the person in the input image as the ONLY subject. Preserve their identity and facial features clearly. Create a hyper-realistic high-fashion editorial photo inside a surreal 3D geometric “color box” room (a hollow cube / tilted cube set). Each render MUST randomly choose: 1) a bold single-color box (monochrome environment, vivid and saturated), 2) a dynamic “cool” fashion pose (gravity-defying or extreme stretch / leap / sideways bracing against the walls), 3) a dramatic camera angle (wide-angle 24–35mm equivalent, tilted horizon, strong perspective). The subject appears full-body and sharp, wearing an avant-garde fashion styling that feels modern and editorial (clean silhouette, stylish layering, premium fabric texture). Keep clothing tasteful and fashion-forward. The subject’s pose should feel athletic, stylish, and unusual—like a magazine campaign shot. Lighting: studio quality, crisp and cinematic; strong key light with controlled soft shadows, subtle rim light; realistic reflections and bounce light from the colored walls. Ultra-detailed skin texture, natural pores, realistic fabric weave, clean edges, high dynamic range. Composition: subject centered with plenty of negative space and strong geometric lines; the box perspective frames the subject. Color: the box color is a SINGLE bold color and MUST be different each run (random vivid hue). The subject’s outfit contrasts well with the box color. Output: hyper-real, photorealistic, 8k detail, editorial campaign quality, sharp focus on subject, no motion blur, no distortion of face, natural proportions.

Photorealistic edit using the input person photo as strict identity reference: keep the same face, facial features, facial proportions, skin tone, hairstyle (color/bangs/length/volume), outfit (design/material/layers), and accessories unchanged (no face swap, no new person, no hair/outfit/accessory change). Only transfer the target pose/gesture: seated on a chair/sofa, pelvis anchored on the seat, torso slightly reclined backward ~15–25°; shoulders mostly facing camera with a slight right-shoulder lift. Legs: confident leg-cross (one leg crossed over the other at the knees), the top leg is the foreground leg closest to the lens; foreground thigh/knee strongly foreshortened and oversized, thigh angled upward toward camera ~30–45°, knee raised high; lower leg drops toward the bottom of frame; feet not visible. The supporting leg stays lower and farther from the lens, knee angled outward ~20–35°, partially occluded by the top thigh. Right arm: elbow bent ~90° and lifted; forearm near-vertical beside the face; right hand near right cheek/temple with palm facing inward (toward the face), back of hand facing camera; fingers in a relaxed curl with index finger slightly extended; light fingertip/knuckle contact on cheekbone/temple with subtle contact shadow (no floating). Left arm relaxed downward; left hand rests on the top thigh near mid-thigh, palm-down with fingers straight/slightly splayed, visible contact shadow and slight pressure indentation. Occlusion must match: foreground thigh occludes pelvis and most of the far leg; right hand partially occludes jawline/cheek edge ~10–20%. Camera/framing locked: low-angle from below knee height looking upward ~10–20° (NOT front-high, NOT overhead-vertical), wide-angle look 18–24mm, close distance; exaggerated perspective with legs/thighs much larger than head; 2/3 body framing, crop at mid-shin/below knees, no feet visible; size ratio guide: foreground thigh occupies ~45–55% frame width, head ~18–25% frame height. Expression add-on (keep identity): chin slightly raised, half-lidded eyes, subtle smirk, arrogant queen-like gaze down toward camera. Clean realistic skin texture, no glam retouch; solid seamless backdrop, flat and textureless,pure light powder blue (approx #B0E0E6).; no new props.Negative Prompt: identity drift, face swap, different person, altered facial proportions, altered facial features, altered skin tone, altered hairstyle, altered hair color, altered bangs, altered outfit, altered accessories, new jewelry, heavy makeup, beauty retouching, plastic skin, waxy skin, wrong pose, uncrossed legs, wrong leg-cross direction, knees not crossed, wrong hip flexion, wrong knee angles, wrong limb direction, wrong body plane, wrong torso recline, wrong torso rotation, wrong weight placement, impossible anatomy, dislocated shoulder, broken elbow, broken wrist, twisted wrist, incorrect hand gesture, wrong palm orientation, floating hands, missing contact shadow, wrong contact point, extra fingers, missing fingers, fused fingers, deformed hands, incorrect occlusion, wrong crop, feet visible, cut-off hands, bad perspective, inconsistent foreshortening, overhead-vertical camera, front-high camera, low-res, text, watermark, logo, background clutter, extra props, added objects

Input A = Console games <instruction> Input A is a Core Technological Function (e.g., Computing, Navigation, Music, Communication). Deconstruct the function into 4 Chronological Artifacts: The Relic (Ancient): Identify the stone or bronze age precursor. (e.g., Computing -> Antikythera Mechanism/Abacus. Navigation -> Lodestone/Sundial). The Instrument (Classical): Identify the brass and wood scientific instrument. (e.g., Computing -> Pascaline. Navigation -> Sextant/Astrolabe). The Machine (Industrial): Identify the steam/steel era mechanism. (e.g., Computing -> Difference Engine/Punch Card Machine. Navigation -> Gyrocompass). The Device (Modern): Identify the current digital interface. (e.g., Computing -> Tablet/Smartphone. Navigation -> GPS Unit). 2. Container Goal: "Tech History" Workbench Photography. Setting: A cluttered Green Cutting Mat on a sturdy wooden workshop table. Background: A computer monitor displays the CAD Wireframe/3D Scan of the "Relic" or "Instrument," implying a restoration or study project. Tools: Calipers, X-Acto knives, and screwdrivers are scattered around the objects. 3. Lineup (The Artifacts): Arrangement: The four objects are arranged left-to-right (Oldest to Newest). Object 1 (Relic): Heavily damaged, calcified, or encased in rock. Looks like an archaeological find. Object 2 (Instrument): Polished Brass and Mahogany. intricate gears and lenses. Object 3 (Machine): Cast Iron, heavy bolts, punch cards, or paper tape. Object 4 (Device): Sleek glass and aluminum. The screen is cracked or the casing is transparent, revealing the circuitry inside. 4. Blueprint (The Connection): The Drawing: A large, coffee-stained Schematic Grid Paper lies flat under the objects. The Content: Hand-drawn diagrams connect the mechanical logic of the Ancient Relic to the circuit logic of the Modern Device. 5. Lighting & Atmosphere: Lighting: Mixed Lighting. A warm, articulated desk lamp illuminates the physical objects, while the cool blue light from the monitor washes over the back wall. Vibe: Intellectual, inquisitive, and tactile. Output: ONE image, 1:1 Aspect Ratio, 3D Render, "Maker Space" aesthetic, High Texture Fidelity. </instruction>