My current workplace (and I love being back at work, I love where I work, and I love my new job) blocks AI stes. In a way, I get it; they pay for my work, and not that of an AI.
(Gemini) I have had trouble getting decent site/architectural renderings out of it. I find that if it makes a wrong turn with a rendering, it's really hard to get it back on track and I get frustrated and have to start over. Example would be I get an image with the building I want but the entryway is the wrong style and no matter what I ask it to do from there on out it just. won't. change. the. entryway.
Precise architectural rendering is a weak spot of most AIs, including image-specific AIs like Midjourney. Still, for my purposes of cleaning up old or rough drawings or renderings without the AI getting "creative" and changing things. Gemini is the best I've found, Here's a few examples from the Unbuilt Buffalo project.
Adelphia Tower (late 1990s): before
After (Gemini; "Clean up and colorize this architectural drawing of the unbuilt Adelphia Tower, proposed for downtown Buffalo in 1997. Don't use dark lines between different colors, shades, or shadow lines.):
ChatGPT, using the same prompt. Not bad!
Compare that with Claude, using the same prompt. It admits failure. "Given it started as a grainy newspaper scan, this is about as clean as pixel-level processing can get without AI upscaling or manual retouching."
Grok, same prompt. I like Grok for text-base actions, but it didn't do that great with this.
Mistral, same prompt. Not too different than Grok.
Midjourney, same prompt. WTF?
Peace Bridge proposal (1913). Before:
After (Gemini):
Ambassador Hotel (1921): before
After (Gemini):
If I want Gemini to get creative, it obliges.
Union Station (1907): before.
After (Gemini).