logo
#

Latest news with #GoogleGeminiLive

I use Gemini Live every day — here's the 7 craziest things I've found it can do for you
I use Gemini Live every day — here's the 7 craziest things I've found it can do for you

Tom's Guide

time02-07-2025

  • Tom's Guide

I use Gemini Live every day — here's the 7 craziest things I've found it can do for you

Of the recent advancements in artificial intelligence, Google Gemini Live might rank as one of the most innovative. Using nothing but your phone, you can engage with the world in countless ways. Similar to how Google Lens allows us to see a translation in real-time, Gemini Live opens up a world of possibility — to learn about bike maintenance, ask about the clouds in the sky and if it might rain, and even cheat at board games. Here are the craziest and most impressive things you can do with Gemini Live, along with some tips on getting the most out of the bot during these interactions. We'll call this one cheating, although it's one of the coolest things you can do with Gemini Live. When you point your camera at your own tiles in Scrabble, for example, the bot will suggest words you can use. In one test shown in the screenshot, Gemini suggested using the word 'deck' for 24 points. The caveat here is that Gemini Live struggles a bit when it comes to valid words — in my tests, the words were a little unusual and in some cases not in the official Scrabble dictionary. I wanted the bot to suggest words that had higher scores, too. You might try some coaching, asking for more common words or to use the double-word score. In the modern connected home, we have gadgets for just about everything, but even adjusting the temperature can be a challenge. Get instant access to breaking news, the hottest reviews, great deals and helpful tips. With Gemini Live, you can point your camera at a thermostat and ask how to adjust the settings. Sometimes, Gemini might need you to focus on the name of the device, but in most cases, it knows the product just by the shape and color. I even tried asking Gemini about something called a Dehumidistat. Gemini explained that you can set the humidity level to the desired level — the Dehumidistat controls the air ventilation system in my home. The bot told me I can use a weather app to check the current humidity level, and then set the humidity to a lower level — such as 50% — to keep things less humid. Gemini can also give you more complex instructions in real-time. Let's say the chain on a bike is a bit rusty (see screenshot). Gemini can give you some tips on applying some lubricant to the chain, but you can go much further than that. If your chain is broken, Gemini can provide detailed tips about how to connect the 'rivets' (the small connectors on the chain) which usually requires something called a Master Link Pliers. Interestingly, you can also ask Gemini which tool to use to repair the rivets and then even where to buy a product like that. This one is really useful at the library or a bookstore. You can pull out your phone, fire up Gemini Live, and point your camera at any book cover. Then, ask the bot to summarize the book. I tested this with even a few brand new books including one called Every Living Thing (about the history of naming plants and animals). Gemini gave a general summary about the book, but then I asked for more details and if the book was worth purchasing. You can even converse about the book, asking about the author, where to buy it, and about similar books. This one was quite fascinating to me. I asked Gemini about the chemical compound of the liquid in my cup. Gemini first noted that my cup had a baby photo and contained coffee; both were correct. But then I asked about the chemical compound of coffee. I expected to hear mostly about caffeine, but the bot proceeded to explain how 'melanoidins' form during roasting to give coffee a dark tone and how 'trigonelline' is an alkaloid that creates the distinct aroma. I did the same with milk and a cup of root beer — the answers were more scientific than I expected. This one is a great party trick because Gemini can read just about anything. I asked the bot to read a handwritten note from one of my grandkids and listened as Gemini read the whole thing. But it started to get even more interesting. Taking a journal which had complex notes about a gadget I'm testing — including the size, weight, and other specs — Gemini read everything perfectly and even guessed which gadget it was (correctly). Then, I tried writing some of this article by hand in the journal and asked Gemini to read portions aloud to see if the writing was clear. It turns out Gemini Live is a great editor and proofreader. I had fun with this one. Gemini Live can tell you which clouds are in the sky based on their shape. Pointing the camera out of my office window, Gemini correctly explained that the clouds were cumulus and that the weather looked 'fair' without a chance of rain. It turns out that Gemini is not a bad meteorologist. Weather forecasters know that cumulus cloud rarely contain rain and are a prime indication that it's likely going to remain at least partly sunny.

ChatGPT Record quietly rolled out for Pro users — here's why I think free accounts could get voice messages soon
ChatGPT Record quietly rolled out for Pro users — here's why I think free accounts could get voice messages soon

Tom's Guide

time20-06-2025

  • Tom's Guide

ChatGPT Record quietly rolled out for Pro users — here's why I think free accounts could get voice messages soon

OpenAI has quietly rolled out a new 'Record' mode for ChatGPT — but for now, it's limited to Pro, Enterprise, and Edu users on the macOS desktop app. The new feature lets you tap a microphone icon and record a short voice message instead of typing. It's a small change, but one that enables ChatGPT to support users as a true voice assistant, making it faster and more conversational than ever. With Record mode, users simply speak their question, and ChatGPT will generate a response based on their audio input. A quick transcription of what you said also appears on screen, keeping the interaction clear and easy to follow. This is a briefer experience, not a full voice conversation like ChatGPT Voice, but it is designed for quick queries on the desktop. Currently limited to Mac users on the ChatGPT app — and only those on a Pro paid plan. Get instant access to breaking news, the hottest reviews, great deals and helpful tips. But I wouldn't be surprised to see it expand to mobile apps and free accounts in the near future. Why? It fits perfectly into OpenAI's larger strategy of making ChatGPT more multimodal; now it further combines voice, vision and text in one experience to support a seamless AI assistance. And let's not forget the competition: Google Gemini Live already supports real-time voice interaction across Android and iOS. If OpenAI wants ChatGPT to match that level of usability — especially on mobile — bringing Record mode to more users makes a lot of sense. Since OpenAI launched ChatGPT Voice for mobile, which I have tested and found incredibly useful, the next logical step is Record mode. However, ChatGPT Record is different, similar to AI transcription apps. It's fast, lightweight, and well-suited for quick interactions when you don't want a full back-and-forth conversation. With multimodal capabilities now a key battleground in the AI assistant space, I'd expect OpenAI to continue expanding features like Record. Giving free-tier users a taste of these tools helps build loyalty and could drive upgrades to paid plans. For now, if you're using ChatGPT Pro, Enterprise, or Edu on a Mac, look for the new mic icon next to the chat box to try Record mode. If you're not on a paid plan, keep an eye out; this is one feature that could be making its way to the broader ChatGPT user base sooner than we may think.

DOWNLOAD THE APP

Get Started Now: Download the App

Ready to dive into a world of global content with local flavor? Download Daily8 app today from your preferred app store and start exploring.
app-storeplay-store