Chipp.in Tech News and Reviews

Windows, Security & Privacy, Open Source and more

Menu
  • Home
  • Windows
  • Security & Privacy
  • Gaming
  • Guides
  • Windows 11 Book
  • Contact
  • RSS Feed
Menu
Spam

Gmail launches improved text classifier to combat spam

Posted on December 5, 2023December 5, 2023 by Martin Brinkmann

Email spam is still a great problem on today’s Internet. Most users who have email accounts receive spam regularly. While most of the spam is detected by mail filters, at least when it comes to most providers, there is still enough spam that slips through the cracks.

Google launched a new text classifier on Gmail that promises better detection rates, less false positives and also improved performance. Called RETVec — Resilient & Efficient Text Vectorizer — it is improving spam detection on Gmail by 38% and reducing false positives by almost 20%.

Google says that RETVec achieves this “combining a novel, highly-compact character encoder, an augmentation-driven training regime, and the use of metric learning”.

Its architecture makes RETVec compatible with any language out of the box and all UTF-8 characters without the need for text processing.

Spammers and malicious actors use different methods to bypass spam filters. Frequent methods include the use of homoglyphs, characters that look very much alike, or the use of invisible characters.

Google claims that Gmail’s new anti-spam system is better suited to identify these tactics and deal with them accordingly.

The company trained the new model internally at Google for a time to better understand its effectiveness. Google says it found it “highly effective for security and anti-abuse applications” as a result of its internal tests.

RETVec in detail

RETVec Gmail anti-spam

RETVec is released as open source. You may visit the GitHub project website for access to the source. There, you will also find more information, including the paper and links to demos.

Google describes RETVec in the following way on GitHub to a development-focused audience:

RETVec is trained to be resilient against character-level manipulations including insertion, deletion, typos, homoglyphs, LEET substitution, and more. The RETVec model is trained on top of a novel character encoder which can encode all UTF-8 characters and words efficiently.

Google notes that RETVec may also be a choice for “on-device and web use cases”. The technology is supported natively in TensorFlow Lite and there is also a custom JavaScript implementation.

Closing Words

Gmail users benefit from the new anti-spam filter on the site. A reduction by 38% is a massive improvement, especially considering Gmail’s daily mail volume. Google benefits from the deployment as well, as performance improves significantly thanks to the lightweight nature of the new text vectorizer.

Now You: do you use Gmail?

Tags: gmail
Category: Security & Privacy

Post navigation

← This script deals with YouTube’s Adblock Popup and Ads
You may extend Windows 10 support beyond 2025; for a price →

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

  • May 18, 2025 by Martin Brinkmann Netflix to use AI "to serve the right ad to the right member at the right time"
  • May 17, 2025 by Martin Brinkmann Windows 10 update may cause another Bitlocker recovery reboot issue
  • May 15, 2025 by Martin Brinkmann Chrome 136 update patches security issue that is exploited in the wild
  • May 13, 2025 by Martin Brinkmann Firefox 138.0.3 fixes two crashes and some other issues
  • May 12, 2025 by Martin Brinkmann Microsoft 365: Windows 10 continues to be supported, at least somewhat

About

We talk, write and dream about Technology 24/7 here at Chipp.in. The site, created by Martin Brinkmann in 2023, focuses on well-researched tech news, reviews, guides, help and more.

Legal Notice

Our commitment

Many websites write about tech, but chipp.in is special in several ways. All of our guides are unique, and we will never just rehash news that you find elsewhere.

Read the About page for additional information on the site and its founder and author.

Support Us

We don't run advertisement on this site that tracks users. If you see ads, they are static links. Ads, including affiliate links, never affect our writing on this site.

Here is a link to our privacy policy

©2025 Chipp.in Tech News and Reviews