Hybrid Crowd-based and Caller Recognition Robocall Reduction System

This system for reducing illegal robocalls uses a “cocktail” of both known and new technologies and methods. We describe first, below, one key novel element. Then, we explain the additional elements of implementation.

This system uses white-lists and black-lists. Uniquely, however, it uses two independent black-lists. The first set of lists uses an Enhanced Caller-ID. (Enhancements are discussed in the Solution.) The second black-list uses caller-recognition based on an audio hash of the first few seconds of the caller’s voice.

Enhanced Caller-ID white-listed calls are put through without further processing. There are four sources for data for the Enhanced Caller-ID white-list:

1) the recipients own contact list (if available); 2) the recipient’s historical marking of the source using a “white button;” 3) approved institutional Enhanced Caller-IDs; 4) sources trusted by the recipient’s telecom provider.

Note that (3) and (4) above are novel sources for white-lists. The “approved institution” is a way for large entities, such as the California DMV or hospitals, that make a large number of outgoing phone calls, to get “pre-white-listed.” Insurance agents with a large client base may wish, for example, to receive this type of advanced approval. Option (4) allows the telecoms to build their own sources of “approved” numbers, which might be any source they “trust.”

The first step is the classification of all incoming calls into one of three categories, using the Enhanced Caller-ID:

1) white-list (known caller); 2) black-list (known spam source); 3) unknown source.

White-list calls are passed through unaltered. Note that the vast majority of all calls will be white-listed. Black-listed calls are dropped immediately, after logging the attempt. Unknown source calls are then processed according the steps below.

Calls of unknown origin are answered electronically, prior to being placed through to the recipient, for example, with a recorded “hello” in the voice of the recipient. Then, the first few seconds of the caller’s voice is recorded. This caller’s voice is then compared to a small, local database of known robocall voices. (A shortened audio “hash” is used for this purpose.) An audio black-list match indicates a robocall and the call is logged and dropped without being put through to the recipient. A non-match causes the caller to hear a “please stand by” response. The call is then put through to the recipient, who first hears the recorded first seconds of the caller’s voice, then the two parties are connected live. The recorded information is deleted about a minute later.

If it turns out the call is a robocall, but the recorded few seconds did not match the local robocall voice database, the recipient presses the “black button,” indicating the source as illegal phone spam. This causes the recorded caller’s voice to be forwarded to a central database, along with other identifying information, such as the Enhanced Caller-ID (which is enhanced—see more information below). An aggregator periodically updates qualified spam audio hashes to the distributed, local databases.

If the caller has not been previously known to the recipient, but is not spam, the recipient presses the “white button,” which automatically adds the Enhanced Caller-ID information into the white-list.

The technology of converting recorded audio into a unique “audio hash” is well known. Because these hashes are short (typically, 64 bytes or less), many thousands of these are inexpensively stored in consumer equipment.

This system causes legitimate first-time callers to experience a slight delay, about five seconds, typically. However, after the first call to a recipient their Enhanced Caller-ID is white-listed, and they will not experience any delay on future calls.

Updates

Kim Rubin started this project — Mar 21, 2014 06:20 PM EDT

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.