Interesting
  • William
  • Blog
  • 8 minutes to read

I Tried Gemini For My Email. Here’s Why I Don’t Trust It

Over the course of the past three months, I have experimented with three AI stacks, each priced at $20 per month for the base tier. I started with ChatGPT Plus, then moved to Perplexity Pro, and finally bought into the promise of Gemini Advanced. I stuck with the latter for the longest spell, primarily owing to its deep integration with other Google products such as Gmail and Docs, which are an integral part of my workflow.

Now, my journey with Gemini hasn’t exactly been glorious, and like a majority of rival generative AI products, it has run into its fair share of hiccups. The inaccuracy woes got so bad that Google chief Sundar Pichai had to apologize for them. Researchers have also independently showcased how it can be manipulated to generate misleading content, spill sensitive data, and even go ahead with malicious tasks. Let’s, for a moment, assume that those are high-level hiccups and that an average user likely won’t run into such problems.

Google is pushing Gemini as a more capable alternative to Google Assistant. The reality, however, is different. I pushed Gemini predominantly for basic tasks, like speeding up my inbox duties, handling my calendar schedule, and just keeping an eye on my Workspace activities. However, the pace at which it served downright misleading information — which merely required a look at my own data instead of an expansive web search — made me question Gemini’s reliability and whether it can be trusted with anything beyond its usual chatbot duties.

Stumbling in its own backyard

Gmail was my first test bed of experimenting with Gemini, hoping to integrate it with my workflow. However, what I found was that it can blatantly lie. As can be seen in the image above, I asked Gemini about the status of my most recent FedEx package. It pulled up information about shipments from last year, but couldn’t pick up a single detail from over two dozen emails from FedEx in my inbox, all of which arrived within a span of one week.

The information it served in the chat box, though outdated, was not inaccurate, down to the tracking number. Where it missed the mark was confidently telling me that the “latest update for your FedEx package” was custom clearance roughly three months ago, and not a series of fresh updates that arrived merely three minutes ago, with a frequency of at least three emails daily, dating back to at least a week.

Likewise, I asked Gemini about “the most recent Calendar entry.” Instead of telling me about the three meetings I closed in the second week of January, it simply replied with “I don’t see any events on your calendar.” This is not only a contextually inaccurate reply, but also downright wrong because instead of looking at “recent” events, Gemini tried to find events in the future.

My Calendar entries are tied inherently to my inbox. I send and receive Google Meet invites directly via my inbox dashboard. It’s just surprising that despite appearing as a standalone tool prominently in the mobile and desktop versions of Gemini, the AI can flub at something as basic as checking for events, and instead going in the opposite direction and reversing the query’s context.

Will it? Won’t it?

Gemini integrates with other Google Workspace apps (and their data) via a system of extensions. A similar pipeline is also in place for crosstalk with other apps, such as WhatsApp. Yet, the experience leaves a lot of room for improvement. Actually, scratch that. Gemini can be infuriatingly dumb at occasions, despite Google touting its natural language chops as one of the best out there.

Let’s start at the most basic level. Just like chat apps, where you can use the “@” shortcut for addressing a person or group, Gemini also relies on the same shortcut for selecting the right Workspace platform to get the job done. Well, it doesn’t seem to work even for the most basic queries. Moreover, comprehension disparities across different platforms only make matters worse.

I summoned Gemini and told it to send a “hi” to my sister. I even used the “@” shortcut and picked Gmail as the destination to execute the task. The AI assistant simply refused to do so in the Gemini app for iPhones, even though it worked just fine moments ago, pulling up minute details from a long Gmail chain about a research paper. In the Android app, Gemini simply couldn’t decide what to do.

Within a span of a few minutes, its responses made a 180-degree flip. In the first attempt, it asked me which “Saba” from my inbox was I referring to, before the message could be sent. On the second try, using the exact same prompt, Gemini outright rejected the chore, citing an inability to do so. Also, the extra information it provided about communication history was downright false.

Failing even at the basics

At the moment, trying to get even the simplest tasks done with Gemini is like playing an AI whack-a-mole, one where you also have to pore over support pages to check whether Gemini will work on your phone. It’s quite vexing that, despite paying $20 a month, Gemini isn’t capable of doing a task as basic as making a call. On an Android phone, I was able to make a call with a simple “Call XYZ @Phone” command.

On iOS, the Gemini app doesn’t support the “@Phone” extension that would allow it to make a call. Alright, let us, for a moment, assume that Apple will never allow an AI access to the Phone app, owing to privacy and security reasons. Also, Apple already has a new avatar of Siri ready, one that is juiced up on OpenAI’s tech stack, so it makes sense to keep such fundamental capabilities locked to its own assistant.

But what about third-party communication platforms like WhatsApp? Well, the ability to send a text on WhatsApp is limited to Android, where you can freely use the “@Whatsapp” shortcut in the chatbot. On the iOS app, you don’t have that luxury. Heck, even the “@Gmail” extension returns an “I can’t assist you with that” response.

These are no small failures. If Google’s assistant can’t even pull off a task as simple as sending an email and runs into platform gates even with third-party apps, there is little point in paying $20 for Gemini on the hollow promises of seamless Workspace access and collaboration.


Source: http://www.slashgear.com/1762021/google-gemini-email-draft-summary-feature-tested/

Inline Feedbacks
View all comments
guest

Saying These Simple Words To ChatGPT Is Costing OpenAI Millions Of Dollars

Growing up, we were all taught to be polite, but when you're one of the world's foremost AI...

An AI Ethics Researcher’s Take On The Future Of Machine Learning In The Art World

Of all AI's uses, none is as controversial as its ability to generate images. The internet is home...

AI-Generated Images Are About To Invade Your iPhone, iPad, And Mac

Apple recently announced a lot of new AI-powered software features that will soon be integrated into the iOS18...

We Tried Apple Intelligence. Here Are The Nine Best Features So Far

Apple's foray into generative AI began with the introduction of iOS 18 at WWDC 2024. In usual fashion,...

Machine Learning Transparency: Making AI Understandable for Business Success

The proliferation of machine learning systems across industries has created an unprecedented challenge for business leaders: how to...

AI Resurrected & Revived A Lost Beatles Song – Then It Won A Grammy

Just over a week ago, former Beatle Paul McCartney asked the British government to strengthen its copyright laws involving...

What Is GeoSpy AI & How Does It Work?

Anyone with a pulse nowadays is painstakingly aware of Artificial Intelligence's rapid growth as an economic and cultural...

8 Of The Best Midjourney Alternatives To Create AI-Generated Images

Midjourney is a popular tool for creating AI-generated images. If you're familiar with it, you know that it...

How Close Are We To AI Superintelligence? The 3 Types Of AI, Explained

We may receive a commission on purchases made from links. OpenAI's Sam Altman said in a blog post...

How Do ‘AI Productivity’ Apps Like Beloga Actually Work?

While general purpose chatbots like OpenAI's ChatGPT are the focus of initial AI consumer hype, AI products with...

What Is The Stargate Project? The United States’ $500 Billion AI Venture, Explained

President Donald Trump has described the launch of the Stargate Project as a "monumental undertaking" and "a resounding declaration...

6 Of The Worst Mistakes Google’s New AI Overview Has Made So Far

In May 2024, Google held its latest Google I/O conference, which opened with a keynote speech that focused...

I Tried Gemini For My Email. Here’s Why I Don’t Trust It

Over the course of the past three months, I have experimented with three AI stacks, each priced at...

The Controversy Of Virtual Influencers And How They’re Taking Over Social Media

AI has made it dramatically easier to make artificial "personalities" within a matter of minutes. A few natural...

Exploring Advertising Options in Telegram Mini Apps

Telegram Mini Apps present a fresh approach to digital advertising through specialized formats designed for the platform's ecosystem....

Is Google’s New AI Listening To Your Phone Calls?

Google would like you to believe that it only wants to keep you safe by offering a new...

OpenAI Reveals Multimodal GPT-4o To Take On Google’s Gemini AI

OpenAI has announced a new model called GPT-4o to power ChatGPT. But, unlike the advancements introduced by previous...

Is Cadillac Really Bringing Back The Escalade EXT In 2026?

Cadillac has long held a reputation for developing innovative, luxury-minded vehicles that don't short drivers in terms of...

Why Some May Not Trust Using Gemini In Their Google Workspace Account

As it competes against other companies in the AI race, Google is pushing its Gemini AI into every...

Как запустить рекламу в ТМА (Telegram Mini Apps): полное руководство

Telegram Mini Apps (TMA) — это не только удобный инструмент для взаимодействия с пользователями, но и мощный канал...