continents. This diversity combines an exciting perspective with coverage on examining data analytics
in the context of today's big data.
It is worth highlighting that this article does not aim to provide a comprehensive evaluation of the current
state of big data analysis, nor to provide a future big data analysis research direction. The intention is to
present the authors' personal viewpoints and offer their perspectives on the future based on their views.
Therefore, there will be always minimal indicative argument or literary support, given the rapidly
changing landscape and significant lag of academic research coverage. Indeed, many critical issues
and relevant approaches are not explicitly covered in this article and are best left to research papers.
While all authors have contributed to the overall study, each author has focused on their specific
specialties in the following discussions. Zhou covers artificial intelligence, while Chawla brings a data
mining and data science perspective. Jin provides a view from computational intelligence and
meta-heuristic global optimization, and Williams draws on a machine learning and data mining
background applied as a practicing data scientist and consultant to industry globally.
Every year, we have been observing a significant improvement in our ability to collect knowledge from
various sensing devices, systems, in multiple formats, from independent or attached applications. This
big data has exceeded our ability to process, analyze, store, and visualize these datasets. Consider the
data on the internet. The websites indexed by Google were around 100 million in 1998, but quickly
reached one billion in 2000 and have now exceeded one trillion in 2008. In 2016, it is around 1.3 trillion.
CLOUD COMPUTNG SERVICE MODELS
Cloud computing offers a variety of release versions, including Platform as a Service (PaaS), Software
as a Service (SaaS), Infrastructure as a Service (IaaS), and Hardware as a Service (HaaS). These
services can provide benefits to businesses that they may not be able to achieve otherwise. Companies
can also use cloud deployment as a test run before implementing a new technology or application.
PaaS provides businesses with a range of options for designing and developing applications. This
includes application design and development tools, application testing, versioning, integration,
deployment, hosting, state monitoring and other relevant development tools. PaaS can help businesses
save costs through standardization and higher utilization of cloud-based computing across different
applications. Other benefits of using PaaS include reducing risks by using pre-tested technologies,
ensuring common services, improving software security, and reducing capacity requirements needed
for new systems development. When it comes to big data, PaaS offers businesses a platform for
creating and using customized applications required to analyze large amounts of unstructured data at a
low cost and low risk in a secure environment.
SaaS provides businesses with applications that are stored and operated on virtual servers in the cloud.
Companies are not charged for hardware, only for the bandwidth and number of users required. The
main advantage of SaaS is that businesses can shift the risks associated with software acquisition while
moving from being capital-intensive to operational. Benefits of using SaaS include easier software
management, automatic updates and patch management, software compatibility across the business,
easier collaboration, and global accessibility. SaaS provides businesses analyzing big data with proven
software solutions for data analysis. The difference between SaaS and PaaS in this case is that SaaS
will not provide a customized solution, whereas PaaS will allow the business to develop a solution
tailored to its needs.
In the IaaS model, a client company will pay for the use of hardware to support processing operations,
including storage, servers, networking equipment, and more. IaaS is the cloud computing model that is
receiving the most attention from the market, with an expectation of 25% of organizations planning to
adopt a provider for IaaS. Services available to businesses through the IaaS model include disaster
recovery, compute as a service, storage as a service, data center as a service, virtual desktop
infrastructure, and cloud bursting, which provides peak load capacity for variable processes. Benefits of
IaaS include increased financial flexibility, choice of services, business agility, cost-effective scalability,
and improved security.
While not yet being used as widely as PaaS, SaaS, or IaaS, HaaS is a cloud service based on the
time-sharing model used on minicomputers and mainframes in the 1960s and 1970s. Time-sharing