This telco's core network team ensures network reliability by monitoring 10 billion records per week as it streams in. Yet the team’s existing analytics solution couldn’t handle the volume or velocity of this data. This delayed reporting of degradations and outages, and even forced the team to manually look through these records to spot trends.
“The usability of the platform really has far exceeded our expectations. We've had folks go in and, within a few minutes, started to create dashboards of their own. That sort of tool adoption rate is kind of unheard of in this space.”
One of the biggest telecommunication carriers (Telco) offers wireless products and services, and has over 150 million subscribers.
The Telco’s core network team is tasked with ensuring network reliability of voice, data, and VOIP calls. The team monitors this data as it streams in－terabytes per week from multiple platforms. Network reliability is critical to Telco, and monitoring and analyzing this data is paramount to sustain network reliability, and to continue to provide the level of service Telco customers expect.
Quickly analyzing the extreme scale of this data proved to be a daunting task. Telco’s existing analytics solutions were unable to efficiently handle the volume and velocity of data, about 10 billion records per week. Network reliability reports took far too long to produce, meaning that a degradation or outage would take too long to identify and resolve, resulting in unhappy customers.
Telco network analysts couldn’t rely on these reports alone, which forced them to manually look through hundreds of millions rows of records for trend analysis. The sheer volume of operational data meant that the team was unable to spot network anomalies, or identify network trends that consumed unusual or unplanned network resources.
“We're in the terabytes weekly of data generation and its just not humanly possible to parse through that, either with our own scripts or our eyeballs. We just can't see what all is in there,” says a Platform Engineer at Telco
For any major corporation, adopting new technology can be a challenge. Telco tried an array of analytic solutions, each promising to handle their unique data challenges. Setup time often took months. Yet one of their biggest obstacles was user adoption.
“Finding the right tool to address data challenges is a difficult task, there’s so much on the market and it’s hard to bring them all in for demos,” says a Sr. Manager of Technology at Telco. “We had significant challenges in finding a product that gave us the performance we were looking for, especially one with the entry level learning curve that facilitated people wanting to use the tool.”
Telco initially tried OmniSci for high performance operational analytics, but quickly discovered that OmniSci met more than just Telco’s performance needs. “Getting OmniSci up and running, in a large environment like Telco, was one of the better experiences that I've had in deploying a tool,” says the Sr. Manager. “The OmniSci technical team was a joy to work with and that was key to us deploying so quickly.”
For OmniSci and Telco, that was only the start of the journey. OmniSci was soon enabling Telco to process billions of rows of data in real time, which allowed them to monitor network performance and maintain its high standards of network reliability. Never before had Telco been able to analyze such data volume with such velocity.
Telco analysts were soon spending less time researching problems, and arriving at solutions far more quickly. It also gave them time to deep-dive into their data to make more data-driven decisions, as well as to accelerate the frequency of their decision making.
What cemented OmniSci as the solution of choice for Telco was the ease of installation and use, which helped drive adoption rates throughout the organization and created a single tool across multiple data centers and business units. “The usability of the platform really has far exceeded our expectations. We've had folks go in and, within a few minutes, started to create dashboards of their own. That sort of tool adoption rate is, in my view, kind of unheard of in this space,” says the Sr. Manager.
"Now everyone from engineers and planners, and managers on up to directors is using OmniSci for the visibility it gives us into our data."
With the help of OmniSci, Telco’s network reliability reports now take only a few seconds to produce, down from over 20 minutes. This allows Telco analysts to quickly identify customer impacting issues across the network. Telco engineers now derive insights faster as well, as they spot immediate problems and long-term trends in a few minutes, down from over 45 minutes.
With the ability to analyze extreme data at scale, and with growing user adoption, Telco is discovering new ways of using OmniSci. “So far with OmniSci, we've been able to solve multiple data problems across multiple platforms, and even across different environments. Instead of just network events and logs, we've actually done it with financial data, with spreadsheets, with smaller data sets that people may not normally look into for performance related data analytics,” quips the Sr. Manager.
Telco is also leveraging the extreme speed of OmniSci to deliver WiFi services in several major U.S. stadiums. The core network team uses OmniSci to view network traffic and resolve stadium authentication and connection issues during short, two-to-three hour stadium events. “Now we have access to this stadium data in near real time, so now we analyze what's going on at the event and we can provide the type of in-stadium service that our customers expect,” the Sr. Manager adds.
OmniSci is also saving Telco money by giving the company the ability to do data forensics and to identify misbehaving devices and rogue users that hog up inordinate resources. These users and devices derail Telco’s resource calculations and allocations. “It costs Telco money because we plan for a certain level of traffic or transactions. When you find out there's these rogue devices or rogue users, you have to go back and revisit your calculations, and you have to be able to hunt these devices down,” claims the Platform Engineer.
Telco believes there’s huge opportunity in leveraging data in every aspect of its business, and this team is reaching out to business owners throughout the organization to explore those opportunities. One of those opportunities is in preventative maintenance.
“We're basically looking at a five day rolling window, currently about nine billion rows, and we’re deep-diving into that data to prevent future problems and troubleshoot issues before they impact our customers,”