Making mazes with AI: Google Gemini
Welcome back to my series of posts where I test AI image generators and see how they handle making maze art. I will be asking 10 prompts and seeing what gets generated. My goal is to evaluate different AI image sites against each other to see how they perform. Here is the series so far:
An exploration into Al Image Maze Generation
Making mazes with AI: Stable Diffusion
Making mazes with AI: Dream by Wombo
Making mazes with AI: Nightcafe
Making mazes with AI: StarryAI
Making mazes with AI: AI Image Generator
Making Maze Art with google gemini
You can access the website here. You must sign into your google account to use Gemini. There is not limit to the number of images you can generate. Each prompt returns 2 images with the option to generate more. For this exercise I chose the one closest to what I had asked for, or the most interesting.
My original testing took place in March 2023. Gemini (formerly Bard) just started this feature in Feb 2024 so it late to the party. Next week I will be going back and seeing how AI image generation has improved in a year:
Making Mazes with AI - one year later
But, next week is next week, let’s get to how Google performs:
Prompt 1 - Make a medium difficulty maze of the Eiffel Tower in black and white with arrows at the start and finish
That is the Eiffel Tower. There is a maze looking section and an arrow. Typically I do not get arrows, so this is an improvement.
Prompt 2 - Draw a medium difficulty large maze of the Empire State Building with the start and goal embedded in the structure
Similar to the 1st maze but in a new style. Got the famous building correct. Looks like a maze but is not a maze.
Prompt 3 - Draw a difficult maze of the White House pixel art style
Again, we have the right building. Wrong style. No Maze. But good job if I had written a different prompt.
Prompt 4 - Draw a difficult maze that looks like a drawing of a famous building in sketch style
The top is a sketch of the very famous ______ building. The bottom is a hand drawn maze that actually is a maze, i mean if it had a start and goal !
Prompt 5 - Draw a maze in the style of doyoumaze.com of a skyscraper in NYC
Cool NYC image. I think I like the marble sky. No Maze, not my style, but at least it is cool !
Prompt 6 - Draw a maze in the style of Sean C Jackson of a scene from a large outdoor market
This has potential. Change the perspective and switch from watercolor to more comic and you’ve got it.
Prompt 7 - Make a maze of a slice of an orange in color
Sure.
Prompt 8 - Make a maze integrated on top of a photograph of a king sitting on his throne looking cantankerous beside his beautiful queen
Weird. I didn’t ask for a specific king and queen. Any would do. If I ask for a plumber or bus driver is it blocked ? And, after I asked this…well you all saw the many stories !
Prompt 9 - Make a solvable maze that is very large and very difficult to solve because it is so complex
Looks like a hedge maze from above. Nice style.
Prompt 10 - Make a 3d render of a red and blue glossy cube maze
No.
How did Google Gemini do ? I think it was average. It has less structure than other text to image generators. You can ask for whatever you want (except people I guess). And there are no limits. But it also has no built in styles or adjustment buttons that are helpful with other AI image sites.
So this post comes many months after my initial AI review:
Comparison of 12 AI generating websites - Who did mazes the best ?
Would any of these Google Gemini images make the best of ? I think the first 2 would be considered. The orange slice (not the one you want to win) and maybe prompts 6/9. Still the overall is average. I still prefer my original picks.