Google just dropped some impressive updates to its Gemini AI model, and they’re rolling out across mobile and desktop. But what really caught my attention is the new live video and screen-sharing capabilities. Yes, Gemini can now watch your camera feed in real-time. And if that doesn’t make you pause for a second and think about the implications for productivity, learning, accessibility, and yes, privacy, I don’t know what will.
Here’s what’s happening:
Google announced a suite of upgrades to Gemini and its underlying tech, including a live agent called Astra, which will become part of our daily lives sooner than we think. The company’s head of DeepMind, Demis Hassabis, demoed a version of Gemini that could watch live through a phone’s camera, analyze what it sees, and respond conversationally. The model even remembered where things were placed in a scene after they disappeared.
Imagine asking your phone: “Where did I leave my keys?” and having it actually know the answer because it saw you put them down earlier. That’s not science fiction anymore.
Another standout feature? Gemini can now see your screen in real-time and help you navigate apps, complete forms, or even troubleshoot issues. Picture this: you’re setting up a new CRM or trying to upload content to a new platform, and instead of Googling your way through forums, Gemini just walks you through it like a hyper-intelligent assistant.
When will these updates be accessible?
The updates are rolling out to the Gemini mobile app, as well as through the side panel in Gmail, Docs, Sheets, Slides, and Drive. For teams already using Google Workspace, this could be a game changer.
Why this matters for marketers
Here’s why I’m excited—and why this matters for marketers, PR pros, educators, and communicators:
- Real-time collaboration and guidance: With Gemini watching your screen or camera feed, training sessions, onboarding, and live problem-solving are about to get a serious AI upgrade.
- Accessibility and support: Think about users who need extra assistance navigating tech. Now they’ve got a tool that doesn’t just respond to commands but understands visual context.
- Next-level personalization: Gemini isn’t just generating text anymore—it’s observing, remembering, and responding with full situational awareness.
Impact on privacy and transparency
Of course, with this kind of tech comes a new wave of responsibility. Transparency, privacy protections, and clear guidelines on how visual data is stored and used will be essential.
But if implemented thoughtfully, this is the kind of generative AI that shifts us from asking AI for help to working alongside it.
We’re entering the age of truly multimodal AI—where models can listen, look, respond, and remember. And for those of us helping teams, brands, and students integrate AI into their workflows, that opens up an entirely new set of tools and strategies.
If you thought prompt engineering was powerful before, just wait until your AI assistant can see what you see.
Let’s keep watching this space—it’s moving fast.
Remember, AI won’t take your job. Someone who knows how to use AI will. Upskilling your team today, ensures success tomorrow. In-person and virtual training workshops are available. Or, schedule a session for a comprehensive AI Transformation strategic roadmap to ensure your marketing team utilizes the right AI tech stack for your needs.
Navigating AI Risks: Protect Your Brand’s Voice
Your brand voice can now be replicated, reshaped, and misrepresented by AI. Learn why it has become a legal asset and how communications teams must adapt to protect and control their narrative.
AI Doesn’t Create Chaos. It Reveals It
The first article in the Human-Led AI Adoption series explains why AI exposes workflow gaps and how organizations build governance, clarity, and scalable integration.
Paid Media Is Coming to AI Conversations (Yes, Even the Personal Ones)
Paid and sponsored content in AI models is here. Small test are proving valuable as brands try to connect authentically without intrusion.

