
Reference Images:
A cinematic 8-photo storytelling collage featuring the same girl throughout the journey. Frame 1: Early morning at home — the girl steps out of her house holding her bicycle, calm and motivated. Frame 2: She cycles through quiet streets, soft morning light, motion blur on wheels. Frame 3: Arrival at the gym — she parks the cycle outside the gym entrance, focused expression. Frame 4: Inside the gym — light warm-up stretches, gym equipment visible in background. Frame 5: Intense workout moment — lifting weights or using machines, sweat and effort visible. Frame 6: Post-workout relief — she drinks a protein shake, relaxed and satisfied. Frame 7: Leaving the gym — gym bag on shoulder, confident walk out. Frame 8: Cycling back home during golden hour, peaceful and accomplished mood. Consistent outfit progression (Gemini casual cycling wear to gym wear), realistic body movement, cinematic lighting, shallow depth of field, natural colors, lifestyle fitness photography style, editorial storytelling layout, ultra-realistic detail, 1:1 aspect ratio.

{ "visual_style_analysis_v2": { "style_definition": { "name": "Industrial X-Ray Transparency Render", "core_aesthetic": [ "Minimalist", "Futuristic", "Clinical Precision", "Structural Functionalism" ], "visual_logic": "Exploded view aesthetics within a closed shell" }, "color_system": { "palette_type": "Monochromatic Grayscale with Metallic Accents", "background": "#F5F5F5 to #FFFFFF (Clean Studio White)", "foreground": { "outer_shell": "Frosted Translucent White/Grey", "inner_components_primary": "Matte Black/Dark Graphite", "inner_components_secondary": "Metallic Silver/Copper", "highlights": "Soft Specular Highlights" } }, "internal_component_architecture": { "description": "Hierarchy of internal parts to create realistic density", "layer_1_structure_skeleton": { "elements": "Injection molding ribs, screw bosses, mounting brackets", "visual_role": "Defines the shape and adds 'ghostly' lines inside the shell" }, "layer_2_power_drive": { "elements": "Cylindrical/Pouch Batteries, Copper Coils, Transformers, Magnets", "visual_role": "Provides visual weight and dense black masses" }, "layer_3_logic_control": { "elements": "Printed Circuit Boards (PCB), Solder points, Silicon Chips, Ribbon Cables", "visual_role": "Adds intricate, fine detail and technical texture" }, "layer_4_interface": { "elements": "Tactile switches, USB-C ports, LED indicators, haptic motors", "visual_role": "Connects the inside to the outside surface" } }, "lighting_and_atmosphere": { "lighting_type": "Backlit Clinical Studio", "characteristics": [ "Transmissive Glow", "Rim Lighting to define edges", "Soft Ambient Occlusion inside" ] }, "materials_and_textures": { "shell_material": "Frosted Polycarbonate (PC) with 85% opacity", "internal_finish": "Matte vs Glossy contrast (components are glossy, shell is matte)", "x_ray_effect": "Depth-based opacity (thicker parts look darker)" }, "demonstration_example": { "product_type": "Digital Mirrorless Camera Body", "prompt_application": "Show stacked optical glass elements inside lens barrel, square image sensor chip, vertical shutter curtain mechanism, dense battery block in grip, and flexible PCBs under top dials" }, "ai_prompt_keywords_suggestion": [ "X-ray render", "translucent frosted shell", "visible internal engineering", "layered internal components", "complex circuitry", "copper coils and batteries", "structural ribbing", "C4D octane render", "industrial exploded view", "schematic realism" ] } }

{ "prompt": "Hyper-realistic black and white portrait of the person in the reference image, with strict preservation of facial features, structure, proportions, and fine details. The identity from the uploaded image must remain unchanged. Natural skin texture with visible pores, fine lines, subtle imperfections, and realistic micro-details. Neutral, serious expression with direct eye contact. Hair texture, length, and style exactly as shown in the uploaded face reference, with clearly defined individual strands. Facial hair only if present in the uploaded face reference, preserved accurately with no addition or removal. No gender assumptions or alterations.\n\nAttire is strictly locked and must not change: a rough-textured fabric scarf wrapped around the neck with visible fibers and slightly frayed edges, paired with a simple rugged outer garment. This attire is mandatory and must remain identical regardless of the uploaded image. Do not adapt clothing from the uploaded subject. No wardrobe substitution or variation.\n\nDramatic low-key lighting with strong directional light from the front-left, creating deep shadows and cinematic contrast. Timeless editorial portrait style. Ultra-sharp focus on the face, shallow depth of field, soft background falloff.", "technical_constraints": { "realism": "hyper-realistic", "detail_level": "ultra-detailed skin texture, micro-details preserved, 8K clarity", "face_preservation": "strict identity lock, exact facial feature retention, no morphing, no reshaping", "attire_lock": "fixed scarf and rugged outer garment, no clothing changes allowed", "camera_angle": "fixed straight-on head-and-shoulders portrait", "lens": "fixed 85mm prime lens", "aspect_ratio": "2:3", "depth_of_field": "shallow depth of field", "lighting": "fixed dramatic low-key lighting, directional front-left light", "dynamic_range": "high dynamic range", "style": "photorealistic, cinematic, film-grade contrast", "retouching": "none, no smoothing, natural imperfections preserved" }, "negative_prompt": "clothing change, wardrobe substitution, subject outfit adaptation, face alteration, identity drift, facial reshaping, feature exaggeration, gender bias, masculine bias, feminine bias, beauty retouching, plastic skin, smooth skin, stylized look, illustration, painting, CGI, blur, low resolution, asymmetry, distorted eyes, distorted mouth, extra facial features" }

A portrait of a shaman in a futuristic sci-fi city. They wear traditional tribal robes mixed with fiber-optic cables and circuit boards. They hold a staff topped with a hologram. The background is a rainy neon street.

A conceptual studio portrait featuring a person leaning forward with their elbows on an old retro TV set, hands resting against their cheeks. Inside the TV screen appears a second synchronized portrait of the same person adjusting sleek black sunglasses. Background is a deep burgundy backdrop. The person wears a black suit jacket with a crisp white shirt, styled in a refined, editorial manner. Symmetrical composition, moody soft lighting, high-fashion artistic aesthetic, ultra-high resolution.

Generate an image of the person from the uploaded reference image, keeping the face and hairstyle exactly as in the uploaded image. Dress the person in the clothing from the uploaded clothing image, preserving the style, color, and details of the garment. Pose the person in a cute and natural pose, looking towards the camera. Realistic photo style, natural lighting, high resolution. Background optional, neutral or "Tokyo street" setting.

City A: Shanghai City B: Beijing --- Generate a photorealistic, isometric macro render of a tectonic city clash diorama. The image must represent a physical collision between the two input cities using the following dynamic logic: 1. Semantic Urban Analysis: Analyze the Input Variable. Assign City A to the LEFT PLATE and City B to the RIGHT PLATE. Architectural DNA: Deconstruct the architectural vernacular of each city. Left Materiality: Construct the Left side using the primary building materials of City A (e.g., if Paris use limestone and zinc; if Tokyo use neon glass and steel; if Venice use brick and water). Right Materiality: Construct the Right side using the contrasting materials of City B. Verticality Logic: Respect the density and height of the cities. (e.g., Skyscrapers must tower over suburban sprawl). 2. The Container (The Tectonic Slabs): The Base: 2 massive, floating chunks of asphalt and bedrock suspended in a void. The Fault Line: a jagged, glowing crack separates the two plates diagonally. The crack glows with the color of the city's energy (e.g., Sublayer Magma, Subway Grates, or Sewer Steam). Connection: a single, precarious bridge representing the "cultural link" connects the two sides across the fault line. 3. Composition: Do not render the whole city. Render one dense "Super-Block" on each side that acts as a caricature of the city's soul. Landmarks: Embed one iconic structure per side, but scale it down to fit the block (e.g., The Eiffel Tower integrated into a street corner, or the Empire State Building rising from a cluster of brownstones). The Street Level: Populate the streets with micro-details specific to the city (e.g., Yellow Taxis vs. Black Cabs; Cherry Blossoms vs. Palm Trees). 4. Atmospheric & Diegetic Lighting: The scene must be lit ONLY by the lights within the cities themselves. Left Atmosphere: Simulate the weather and lighting color temperature of City A (e.g., London = Foggy/Cool Blue; Miami = Sunny/Warm Orange). Right Atmosphere: Simulate the weather and lighting color temperature of City B. Contrast: The visual clash comes from the bleeding together of these two lighting environments at the Fault Line. 5. Visual Syntax: Camera: Isometric Orthographic projection, 45-degree high angle. Focus: Tilt-shift effect, blurring the bottom of the bedrock and the deep background void, keeping the rooftops sharp. Render Style: Hyper-realistic miniature model photography, "Kitbash" aesthetic, Octane Render. Output: 8k resolution, highly detailed textures, volumetric lighting, photorealistic.

A cozy dreamy high-fashion editorial portrait. A person fully surrounded by dozens of ultra-fluffy kittens of different breeds, filling the entire frame like a soft living cloud. Extremely long, dense, airy fur, plush texture, maximum volume, soft halos around each kitten. A mix of British Shorthair, Scottish Fold, Maine Coon kittens, Ragdoll, Persian and Norwegian Forest kittens. All kittens are small, clean, healthy, baby-like proportions, big eyes, round faces, in white, cream, beige, gray and soft ginger tones, with hyper-realistic fur details. The person is lying calmly in the center, as if resting near a fireplace, eyes gently closed or with a soft peaceful smile, deep cozy holiday mood. Wearing a warm red New Year-inspired cozy sweater: oversized chunky knit, soft wool or cashmere texture, thick yarn, rich deep red color, relaxed fit, elegant and festive, hygge aesthetic, no logos. Warm fireplace lighting, glowing amber light, gentle shadows, cinematic warmth, soft fur highlights. lights. Delicate Christmas garlands with warm fairy lights softly surrounding the scene, creamy bokeh lights in the background, magical winter atmosphere. Top-down composition, symmetrical, cinematic fashion photography, Vogue editorial style, ultra-detailed, photorealistic, shallow depth of field, 85mm lens look, high resolution. Face preservation: keep facial structure from reference photo natural skin texture realistic proportions Quality: ultra high quality, editorial photography, soft focus, luxury aesthetic, ultra-cozy surreal winter mood. Christmas warmth, bygge, softness overload

{ "type": "image_generation_prompt", "identity_preservation": { "use_reference_image": true, "alter_face": false, "notes": "Preserve the woman’s exact facial identity, structure, and natural features from the uploaded image." }, "subject": { "gender": "female", "pose": { "position": "sitting confidently in the center of the room", "posture": "leaning slightly forward", "hands": "resting on her knees", "expression": "confident and mysterious, looking directly at the camera" }, "wardrobe": { "top": "black oversized T-shirt or hoodie", "bottoms": "black trousers", "footwear": "black sneakers", "accessories": "black cap" } }, "environment": { "setting": "dramatic red-lit room", "decor": { "floor": "stacks of cash piled on the ground", "tables": "metal or wooden tables covered with scattered money", "props": [ "cash boxes", "open briefcases filled with money" ] }, "atmosphere": "intense, secretive, cinematic — evoking a high-stakes, confidential environment" }, "lighting": { "primary_tone": "deep red dominating the room", "secondary_tone": "subtle cyan reflections on the cap and cash highlights", "style": "moody and cinematic with strong red–cyan contrast", "aesthetic": "premium tech-crime thriller look" }, "camera_style": { "resolution": "ultra-high-resolution", "composition": "cinematic centered framing", "focus": "sharp on subject, softer background depth", "mood": "bold and mysterious" }, "rendering": { "realism": "photorealistic", "detail_level": "ultra-detailed cash, clothing textures, lighting reflections", "restrictions": [ "no text", "no digital overlays" ] }, "output_goal": "Create a cinematic, ultra-high-resolution portrait of a woman in an intense red-lit room full of cash, preserving her exact facial identity and delivering a bold, mysterious, high-stakes thriller aesthetic." }