Interesting
  • William
  • Blog
  • 8 minutes to read

I Tried Gemini For My Email. Here’s Why I Don’t Trust It

Over the course of the past three months, I have experimented with three AI stacks, each priced at $20 per month for the base tier. I started with ChatGPT Plus, then moved to Perplexity Pro, and finally bought into the promise of Gemini Advanced. I stuck with the latter for the longest spell, primarily owing to its deep integration with other Google products such as Gmail and Docs, which are an integral part of my workflow.

Now, my journey with Gemini hasn’t exactly been glorious, and like a majority of rival generative AI products, it has run into its fair share of hiccups. The inaccuracy woes got so bad that Google chief Sundar Pichai had to apologize for them. Researchers have also independently showcased how it can be manipulated to generate misleading content, spill sensitive data, and even go ahead with malicious tasks. Let’s, for a moment, assume that those are high-level hiccups and that an average user likely won’t run into such problems.

Google is pushing Gemini as a more capable alternative to Google Assistant. The reality, however, is different. I pushed Gemini predominantly for basic tasks, like speeding up my inbox duties, handling my calendar schedule, and just keeping an eye on my Workspace activities. However, the pace at which it served downright misleading information — which merely required a look at my own data instead of an expansive web search — made me question Gemini’s reliability and whether it can be trusted with anything beyond its usual chatbot duties.

Stumbling in its own backyard

Gmail was my first test bed of experimenting with Gemini, hoping to integrate it with my workflow. However, what I found was that it can blatantly lie. As can be seen in the image above, I asked Gemini about the status of my most recent FedEx package. It pulled up information about shipments from last year, but couldn’t pick up a single detail from over two dozen emails from FedEx in my inbox, all of which arrived within a span of one week.

The information it served in the chat box, though outdated, was not inaccurate, down to the tracking number. Where it missed the mark was confidently telling me that the “latest update for your FedEx package” was custom clearance roughly three months ago, and not a series of fresh updates that arrived merely three minutes ago, with a frequency of at least three emails daily, dating back to at least a week.

Likewise, I asked Gemini about “the most recent Calendar entry.” Instead of telling me about the three meetings I closed in the second week of January, it simply replied with “I don’t see any events on your calendar.” This is not only a contextually inaccurate reply, but also downright wrong because instead of looking at “recent” events, Gemini tried to find events in the future.

My Calendar entries are tied inherently to my inbox. I send and receive Google Meet invites directly via my inbox dashboard. It’s just surprising that despite appearing as a standalone tool prominently in the mobile and desktop versions of Gemini, the AI can flub at something as basic as checking for events, and instead going in the opposite direction and reversing the query’s context.

Will it? Won’t it?

Gemini integrates with other Google Workspace apps (and their data) via a system of extensions. A similar pipeline is also in place for crosstalk with other apps, such as WhatsApp. Yet, the experience leaves a lot of room for improvement. Actually, scratch that. Gemini can be infuriatingly dumb at occasions, despite Google touting its natural language chops as one of the best out there.

Let’s start at the most basic level. Just like chat apps, where you can use the “@” shortcut for addressing a person or group, Gemini also relies on the same shortcut for selecting the right Workspace platform to get the job done. Well, it doesn’t seem to work even for the most basic queries. Moreover, comprehension disparities across different platforms only make matters worse.

I summoned Gemini and told it to send a “hi” to my sister. I even used the “@” shortcut and picked Gmail as the destination to execute the task. The AI assistant simply refused to do so in the Gemini app for iPhones, even though it worked just fine moments ago, pulling up minute details from a long Gmail chain about a research paper. In the Android app, Gemini simply couldn’t decide what to do.

Within a span of a few minutes, its responses made a 180-degree flip. In the first attempt, it asked me which “Saba” from my inbox was I referring to, before the message could be sent. On the second try, using the exact same prompt, Gemini outright rejected the chore, citing an inability to do so. Also, the extra information it provided about communication history was downright false.

Failing even at the basics

At the moment, trying to get even the simplest tasks done with Gemini is like playing an AI whack-a-mole, one where you also have to pore over support pages to check whether Gemini will work on your phone. It’s quite vexing that, despite paying $20 a month, Gemini isn’t capable of doing a task as basic as making a call. On an Android phone, I was able to make a call with a simple “Call XYZ @Phone” command.

On iOS, the Gemini app doesn’t support the “@Phone” extension that would allow it to make a call. Alright, let us, for a moment, assume that Apple will never allow an AI access to the Phone app, owing to privacy and security reasons. Also, Apple already has a new avatar of Siri ready, one that is juiced up on OpenAI’s tech stack, so it makes sense to keep such fundamental capabilities locked to its own assistant.

But what about third-party communication platforms like WhatsApp? Well, the ability to send a text on WhatsApp is limited to Android, where you can freely use the “@Whatsapp” shortcut in the chatbot. On the iOS app, you don’t have that luxury. Heck, even the “@Gmail” extension returns an “I can’t assist you with that” response.

These are no small failures. If Google’s assistant can’t even pull off a task as simple as sending an email and runs into platform gates even with third-party apps, there is little point in paying $20 for Gemini on the hollow promises of seamless Workspace access and collaboration.


Source: http://www.slashgear.com/1762021/google-gemini-email-draft-summary-feature-tested/

Inline Feedbacks
View all comments
guest

How Do ‘AI Productivity’ Apps Like Beloga Actually Work?

While general purpose chatbots like OpenAI's ChatGPT are the focus of initial AI consumer hype, AI products with...

AI Governance in the Age of Uncertainty: Building Regulatory Frameworks for Unknown Futures

The emergence of artificial intelligence as a transformative force in human society has created an unprecedented regulatory paradox....

What Is Agentic AI & How Might It Change How The World Works In The Future?

If films and TV have taught us anything, it's that the future ought to be full of autonomous,...

What Is The Stargate Project? The United States’ $500 Billion AI Venture, Explained

President Donald Trump has described the launch of the Stargate Project as a "monumental undertaking" and "a resounding declaration...

Is The 2026 Ford Mustang 288 V8 Real?

While the age of the internet has largely changed the world for the better, that progress has, understandably,...

Just How Much Energy Does Generating An AI Image Actually Use?

Image generation with the use of artificial intelligence has become commonplace online, with plenty of buzz surrounding the...

Do AI Humanizers Actually Work? We Tested Them And This Is What We Found

First, we had ChatGPT and other Generative Pre-Trained Transformers, which created AI-generated text. Next, we had AI Detectors,...

The Controversy Of Virtual Influencers And How They’re Taking Over Social Media

AI has made it dramatically easier to make artificial "personalities" within a matter of minutes. A few natural...

Machine Learning Transparency: Making AI Understandable for Business Success

The proliferation of machine learning systems across industries has created an unprecedented challenge for business leaders: how to...

AI-Generated Images Are About To Invade Your iPhone, iPad, And Mac

Apple recently announced a lot of new AI-powered software features that will soon be integrated into the iOS18...

Saying These Simple Words To ChatGPT Is Costing OpenAI Millions Of Dollars

Growing up, we were all taught to be polite, but when you're one of the world's foremost AI...

We Tried Apple Intelligence. Here Are The Nine Best Features So Far

Apple's foray into generative AI began with the introduction of iOS 18 at WWDC 2024. In usual fashion,...

Brisk AI: How Does It Work, Who Is It For, & What Data Does It Collect?

Artificial Intelligence is quickly becoming rather synonymous with convenience and efficiency. Gone are the days when tasks like...

The Formerly Futuristic Way Amazon Scans For Product Defects Before Shipping

We may receive a commission on purchases made from links. When you order a product from Amazon, you...

Why Some May Not Trust Using Gemini In Their Google Workspace Account

As it competes against other companies in the AI race, Google is pushing its Gemini AI into every...

Exploring Advertising Options in Telegram Mini Apps

Telegram Mini Apps present a fresh approach to digital advertising through specialized formats designed for the platform's ecosystem....

Is Gemini Advanced Really Worth Paying For?

Google is charging ahead in the AI race, putting the full weight of its influence behind its Gemini...

4 Ways NASA Is Using AI For Space Exploration

The buzz and controversy surrounding artificial intelligence may make it seem pretty new, but NASA has been using...

Celebrity Voices Like John Cena And Awkwafina Headline Meta’s Latest AI Upgrades

As part of the Meta Connect keynote, the technological giant has unveiled a variety of new developments in...

Elon Musk-Led Group Might Buy OpenAI For $97B, Crushing Altman’s Plans For ChatGPT

The saga of Elon Musk and OpenAI — the AI giant he helped start a decade ago and...