Category | UK education collection

Big Data Management and Processing (Chapman & Hall/CRC Big Data Series)

by Kuan-Ching Li Hai Jiang Albert Y. Zomaya

From the Foreword: "Big Data Management and Processing is [a] state-of-the-art book that deals with a wide range of topical themes in the field of Big Data. The book, which probes many issues related to this exciting and rapidly growing field, covers processing, management, analytics, and applications... [It] is a very valuable addition to the literature. It will serve as a source of up-to-date research in this continuously developing area. The book also provides an opportunity for researchers to explore the use of advanced computing technologies and their impact on enhancing our capabilities to conduct more sophisticated studies." ---Sartaj Sahni, University of Florida, USA "Big Data Management and Processing covers the latest Big Data research results in processing, analytics, management and applications. Both fundamental insights and representative applications are provided. This book is a timely and valuable resource for students, researchers and seasoned practitioners in Big Data fields. --Hai Jin, Huazhong University of Science and Technology, China Big Data Management and Processing explores a range of big data related issues and their impact on the design of new computing systems. The twenty-one chapters were carefully selected and feature contributions from several outstanding researchers. The book endeavors to strike a balance between theoretical and practical coverage of innovative problem solving techniques for a range of platforms. It serves as a repository of paradigms, technologies, and applications that target different facets of big data computing systems. The first part of the book explores energy and resource management issues, as well as legal compliance and quality management for Big Data. It covers In-Memory computing and In-Memory data grids, as well as co-scheduling for high performance computing applications. The second part of the book includes comprehensive coverage of Hadoop and Spark, along with security, privacy, and trust challenges and solutions. The latter part of the book covers mining and clustering in Big Data, and includes applications in genomics, hospital big data processing, and vehicular cloud computing. The book also analyzes funding for Big Data projects.

Copyright: 2017

ISBN: 9781498768085

Features:

Big data management in Sensing: Applications in AI and IoT

by Renny Fernandez Terrance Frederick Fernandez

The book is centrally focused on human computer Interaction and how sensors within small and wide groups of Nano-robots employ Deep Learning for applications in industry. It covers a wide array of topics that are useful for researchers and students to gain knowledge about AI and sensors in nanobots. Furthermore, the book explores Deep Learning approaches to enhance the accuracy of AI systems applied in medical robotics for surgical techniques. Secondly, we plan to explore bio-nano-robotics, which is a field in nano-robotics, that deals with automatic intelligence handling, self-assembly and replication, information processing and programmability.

Copyright: 2021

ISBN: 9781000797435

Features: Contains images

Big data management in Sensing: Applications in AI and IoT

The book is centrally focused on human computer Interaction and how sensors within small and wide groups of Nano-robots employ Deep Learning for applications in industry. It covers a wide array of topics that are useful for researchers and students to gain knowledge about AI and sensors in nanobots. Furthermore, the book explores Deep Learning approaches to enhance the accuracy of AI systems applied in medical robotics for surgical techniques. Secondly, we plan to explore bio-nano-robotics, which is a field in nano-robotics, that deals with automatic intelligence handling, self-assembly and replication, information processing and programmability.

Copyright: 2021

ISBN: 9781000794274

Features:

Big Data MBA: Driving Business Strategies with Data Science

by Bill Schmarzo

Integrate big data into business to drive competitive advantage and sustainable success Big Data MBA brings insight and expertise to leveraging big data in business so you can harness the power of analytics and gain a true business advantage. Based on a practical framework with supporting methodology and hands-on exercises, this book helps identify where and how big data can help you transform your business. You'll learn how to exploit new sources of customer, product, and operational data, coupled with advanced analytics and data science, to optimize key processes, uncover monetization opportunities, and create new sources of competitive differentiation. The discussion includes guidelines for operationalizing analytics, optimal organizational structure, and using analytic insights throughout your organization's user experience to customers and front-end employees alike. You'll learn to “think like a data scientist” as you build upon the decisions your business is trying to make, the hypotheses you need to test, and the predictions you need to produce. Business stakeholders no longer need to relinquish control of data and analytics to IT. In fact, they must champion the organization's data collection and analysis efforts. This book is a primer on the business approach to analytics, providing the practical understanding you need to convert data into opportunity. Understand where and how to leverage big data Integrate analytics into everyday operations Structure your organization to drive analytic insights Optimize processes, uncover opportunities, and stand out from the rest Help business stakeholders to “think like a data scientist” Understand appropriate business application of different analytic techniques If you want data to transform your business, you need to know how to put it to use. Big Data MBA shows you how to implement big data and analytics to make better decisions.

Copyright: 2016

ISBN: 9781119181385

Features: Contains images

Big Data MBA: Driving Business Strategies with Data Science

by Bill Schmarzo

Integrate big data into business to drive competitive advantage and sustainable success Big Data MBA brings insight and expertise to leveraging big data in business so you can harness the power of analytics and gain a true business advantage. Based on a practical framework with supporting methodology and hands-on exercises, this book helps identify where and how big data can help you transform your business. You'll learn how to exploit new sources of customer, product, and operational data, coupled with advanced analytics and data science, to optimize key processes, uncover monetization opportunities, and create new sources of competitive differentiation. The discussion includes guidelines for operationalizing analytics, optimal organizational structure, and using analytic insights throughout your organization's user experience to customers and front-end employees alike. You'll learn to “think like a data scientist” as you build upon the decisions your business is trying to make, the hypotheses you need to test, and the predictions you need to produce. Business stakeholders no longer need to relinquish control of data and analytics to IT. In fact, they must champion the organization's data collection and analysis efforts. This book is a primer on the business approach to analytics, providing the practical understanding you need to convert data into opportunity. Understand where and how to leverage big data Integrate analytics into everyday operations Structure your organization to drive analytic insights Optimize processes, uncover opportunities, and stand out from the rest Help business stakeholders to “think like a data scientist” Understand appropriate business application of different analytic techniques If you want data to transform your business, you need to know how to put it to use. Big Data MBA shows you how to implement big data and analytics to make better decisions.

Copyright: 2016

ISBN: 9781119238843

Features:

Big Data, Mining, and Analytics: Components of Strategic Decision Making

by Stephan Kudyba

There is an ongoing data explosion transpiring that will make previous creations, collections, and storage of data look trivial. Big Data, Mining, and Analytics: Components of Strategic Decision Making ties together big data, data mining, and analytics to explain how readers can leverage them to extract valuable insights from their data. Facilitati

Copyright: 2014

ISBN: 9781466568716

Features:

Big Data of Complex Networks (Chapman & Hall/CRC Big Data Series)

by Matthias Dehmer Frank Emmert-Streib Stefan Pickl Andreas Holzinger

Big Data of Complex Networks presents and explains the methods from the study of big data that can be used in analysing massive structural data sets, including both very large networks and sets of graphs. As well as applying statistical analysis techniques like sampling and bootstrapping in an interdisciplinary manner to produce novel techniques for analyzing massive amounts of data, this book also explores the possibilities offered by the special aspects such as computer memory in investigating large sets of complex networks. Intended for computer scientists, statisticians and mathematicians interested in the big data and networks, Big Data of Complex Networks is also a valuable tool for researchers in the fields of visualization, data analysis, computer vision and bioinformatics. Key features: Provides a complete discussion of both the hardware and software used to organize big data Describes a wide range of useful applications for managing big data and resultant data sets Maintains a firm focus on massive data and large networks Unveils innovative techniques to help readers handle big data Matthias Dehmer received his PhD in computer science from the Darmstadt University of Technology, Germany. Currently, he is Professor at UMIT – The Health and Life Sciences University, Austria, and the Universität der Bundeswehr München. His research interests are in graph theory, data science, complex networks, complexity, statistics and information theory. Frank Emmert-Streib received his PhD in theoretical physics from the University of Bremen, and is currently Associate professor at Tampere University of Technology, Finland. His research interests are in the field of computational biology, machine learning and network medicine. Stefan Pickl holds a PhD in mathematics from the Darmstadt University of Technology, and is currently a Professor at Bundeswehr Universität München. His research interests are in operations research, systems biology, graph theory and discrete optimization. Andreas Holzinger received his PhD in cognitive science from Graz University and his habilitation (second PhD) in computer science from Graz University of Technology. He is head of the Holzinger Group HCI-KDD at the Medical University Graz and Visiting Professor for Machine Learning in Health Informatics Vienna University of Technology.

Copyright: 2017

ISBN: 9781498723626

Features:

Big Data of Complex Networks (Chapman & Hall/CRC Big Data Series)

by Matthias Dehmer, Frank Emmert-Streib, Stefan Pickl and Andreas Holzinger

Big Data of Complex Networks presents and explains the methods from the study of big data that can be used in analysing massive structural data sets, including both very large networks and sets of graphs. As well as applying statistical analysis techniques like sampling and bootstrapping in an interdisciplinary manner to produce novel techniques for analyzing massive amounts of data, this book also explores the possibilities offered by the special aspects such as computer memory in investigating large sets of complex networks. Intended for computer scientists, statisticians and mathematicians interested in the big data and networks, Big Data of Complex Networks is also a valuable tool for researchers in the fields of visualization, data analysis, computer vision and bioinformatics. Key features: Provides a complete discussion of both the hardware and software used to organize big data Describes a wide range of useful applications for managing big data and resultant data sets Maintains a firm focus on massive data and large networks Unveils innovative techniques to help readers handle big data Matthias Dehmer received his PhD in computer science from the Darmstadt University of Technology, Germany. Currently, he is Professor at UMIT – The Health and Life Sciences University, Austria, and the Universität der Bundeswehr München. His research interests are in graph theory, data science, complex networks, complexity, statistics and information theory. Frank Emmert-Streib received his PhD in theoretical physics from the University of Bremen, and is currently Associate professor at Tampere University of Technology, Finland. His research interests are in the field of computational biology, machine learning and network medicine. Stefan Pickl holds a PhD in mathematics from the Darmstadt University of Technology, and is currently a Professor at Bundeswehr Universität München. His research interests are in operations research, systems biology, graph theory and discrete optimization. Andreas Holzinger received his PhD in cognitive science from Graz University and his habilitation (second PhD) in computer science from Graz University of Technology. He is head of the Holzinger Group HCI-KDD at the Medical University Graz and Visiting Professor for Machine Learning in Health Informatics Vienna University of Technology.

Copyright: 2017

ISBN: 9781315353593

Features: Contains images

Big Data on Campus: Data Analytics and Decision Making in Higher Education

by Karen L. Webber and Henry Y. Zheng

The continuing importance of data analytics is not lost on higher education leaders, who face a multitude of challenges, including increasing operating costs, dwindling state support, limits to tuition increases, and increased competition from the for-profit sector. To navigate these challenges, savvy leaders must leverage data to make sound decisions. In Big Data on Campus, leading data analytics experts and higher ed leaders show the role that analytics can play in the better administration of colleges and universities. Aimed at senior administrative leaders, practitioners of institutional research, technology professionals, and graduate students in higher education, the book opens with a conceptual discussion of the roles that data analytics can play in higher education administration. Subsequent chapters address recent developments in technology, the rapid accumulation of data assets, organizational maturity in building analytical capabilities, and methodological advancements in developing predictive and prescriptive analytics. Each chapter includes a literature review of the research and application of analytics developments in their respective functional areas, a discussion of industry trends, examples of the application of data analytics in their decision process, and other related issues that readers may wish to consider in their own organizational environment to find opportunities for building robust data analytics capabilities.Using a series of focused discussions and case studies, Big Data on Campus helps readers understand how analytics can support major organizational functions in higher education, including admission decisions, retention and enrollment management, student life and engagement, academic and career advising, student learning and assessment, and academic program planning. The final section of the book addresses major issues and human factors involved in using analytics to support decision making; the ethical, cultural, and managerial implications of its use; the role of university leaders in promoting analytics in decision making; and the need for a strong campus community to embrace the analytics revolution. Contributors: Rana Glasgal, J. Michael Gower, Tom Gutman, Brian P. Hinote, Braden J. Hosch, Aditya Johri, Christine M. Keller, Carrie Klein, Jaime Lester, Carrie Hancock Marcinkevage, Gail B. Marsh, Susan M. Menditto, Jillian N. Morn, Valentina Nestor, Cathy O'Bryan, Huzefa Rangwala, Timothy Renick, Charles Tegen, Rachit Thariani, Chris Tompkins, Lindsay K. Wayt, Karen L. Webber, Henry Y. Zheng, Ying Zhou

Copyright: 2020

ISBN: 9781421439044

Features: Contains images

Big Data on Campus: Data Analytics and Decision Making in Higher Education

by Karen L. Webber Henry Y. Zheng

The continuing importance of data analytics is not lost on higher education leaders, who face a multitude of challenges, including increasing operating costs, dwindling state support, limits to tuition increases, and increased competition from the for-profit sector. To navigate these challenges, savvy leaders must leverage data to make sound decisions. In Big Data on Campus, leading data analytics experts and higher ed leaders show the role that analytics can play in the better administration of colleges and universities. Aimed at senior administrative leaders, practitioners of institutional research, technology professionals, and graduate students in higher education, the book opens with a conceptual discussion of the roles that data analytics can play in higher education administration. Subsequent chapters address recent developments in technology, the rapid accumulation of data assets, organizational maturity in building analytical capabilities, and methodological advancements in developing predictive and prescriptive analytics. Each chapter includes a literature review of the research and application of analytics developments in their respective functional areas, a discussion of industry trends, examples of the application of data analytics in their decision process, and other related issues that readers may wish to consider in their own organizational environment to find opportunities for building robust data analytics capabilities.Using a series of focused discussions and case studies, Big Data on Campus helps readers understand how analytics can support major organizational functions in higher education, including admission decisions, retention and enrollment management, student life and engagement, academic and career advising, student learning and assessment, and academic program planning. The final section of the book addresses major issues and human factors involved in using analytics to support decision making; the ethical, cultural, and managerial implications of its use; the role of university leaders in promoting analytics in decision making; and the need for a strong campus community to embrace the analytics revolution. Contributors: Rana Glasgal, J. Michael Gower, Tom Gutman, Brian P. Hinote, Braden J. Hosch, Aditya Johri, Christine M. Keller, Carrie Klein, Jaime Lester, Carrie Hancock Marcinkevage, Gail B. Marsh, Susan M. Menditto, Jillian N. Morn, Valentina Nestor, Cathy O'Bryan, Huzefa Rangwala, Timothy Renick, Charles Tegen, Rachit Thariani, Chris Tompkins, Lindsay K. Wayt, Karen L. Webber, Henry Y. Zheng, Ying Zhou

Copyright: 2020

ISBN: 9781421439044

Features:

Big Data Optimization: Recent Developments and Challenges (Studies in Big Data #18)

by Ali Emrouznejad

The main objective of this book is to provide the necessary background to work with big data by introducing some novel optimization algorithms and codes capable of working in the big data setting as well as introducing some applications in big data optimization for both academics and practitioners interested, and to benefit society, industry, academia, and government. Presenting applications in a variety of industries, this book will be useful for the researchers aiming to analyses large scale data. Several optimization algorithms for big data including convergent parallel algorithms, limited memory bundle algorithm, diagonal bundle method, convergent parallel algorithms, network analytics, and many more have been explored in this book.

Copyright: 2016

ISBN: 9783319302652

Features: Contains images

Big Data Platforms and Applications: Case Studies, Methods, Techniques, and Performance Evaluation (Computer Communications and Networks)

by Florin Pop Gabriel Neagu

This book provides a review of advanced topics relating to the theory, research, analysis and implementation in the context of big data platforms and their applications, with a focus on methods, techniques, and performance evaluation. The explosive growth in the volume, speed, and variety of data being produced every day requires a continuous increase in the processing speeds of servers and of entire network infrastructures, as well as new resource management models. This poses significant challenges (and provides striking development opportunities) for data intensive and high-performance computing, i.e., how to efficiently turn extremely large datasets into valuable information and meaningful knowledge.The task of context data management is further complicated by the variety of sources such data derives from, resulting in different data formats, with varying storage, transformation, delivery, and archiving requirements. At the same time rapid responses are needed for real-time applications. With the emergence of cloud infrastructures, achieving highly scalable data management in such contexts is a critical problem, as the overall application performance is highly dependent on the properties of the data management service.

Copyright: 2021

ISBN: 9783030388362

Features: Contains images

Big Data, Political Campaigning and the Law: Democracy and Privacy in the Age of Micro-Targeting

by Janice Richardson Moira Paterson Normann Witzleb

In this multidisciplinary book, experts from around the globe examine how data-driven political campaigning works, what challenges it poses for personal privacy and democracy, and how emerging practices should be regulated. The rise of big data analytics in the political process has triggered official investigations in many countries around the world, and become the subject of broad and intense debate. Political parties increasingly rely on data analytics to profile the electorate and to target specific voter groups with individualised messages based on their demographic attributes. Political micro-targeting has become a major factor in modern campaigning, because of its potential to influence opinions, to mobilise supporters and to get out votes. The book explores the legal, philosophical and political dimensions of big data analytics in the electoral process. It demonstrates that the unregulated use of big personal data for political purposes not only infringes voters’ privacy rights, but also has the potential to jeopardise the future of the democratic process, and proposes reforms to address the key regulatory and ethical questions arising from the mining, use and storage of massive amounts of voter data. Providing an interdisciplinary assessment of the use and regulation of big data in the political process, this book will appeal to scholars from law, political science, political philosophy and media studies, policy makers and anyone who cares about democracy in the age of data-driven political campaigning.

Copyright: 2020

ISBN: 9781000741018

Features:

Big Data, Political Campaigning and the Law: Democracy and Privacy in the Age of Micro-Targeting

by Janice Richardson Moira Paterson Normann Witzleb

In this multidisciplinary book, experts from around the globe examine how data-driven political campaigning works, what challenges it poses for personal privacy and democracy, and how emerging practices should be regulated. The rise of big data analytics in the political process has triggered official investigations in many countries around the world, and become the subject of broad and intense debate. Political parties increasingly rely on data analytics to profile the electorate and to target specific voter groups with individualised messages based on their demographic attributes. Political micro-targeting has become a major factor in modern campaigning, because of its potential to influence opinions, to mobilise supporters and to get out votes. The book explores the legal, philosophical and political dimensions of big data analytics in the electoral process. It demonstrates that the unregulated use of big personal data for political purposes not only infringes voters’ privacy rights, but also has the potential to jeopardise the future of the democratic process, and proposes reforms to address the key regulatory and ethical questions arising from the mining, use and storage of massive amounts of voter data. Providing an interdisciplinary assessment of the use and regulation of big data in the political process, this book will appeal to scholars from law, political science, political philosophy and media studies, policy makers and anyone who cares about democracy in the age of data-driven political campaigning.

Copyright: 2020

ISBN: 9781000747393

Features:

Big Data Preprocessing: Enabling Smart Data

by Julián Luengo Diego García-Gil Sergio Ramírez-Gallego Salvador García Francisco Herrera

This book offers a comprehensible overview of Big Data Preprocessing, which includes a formal description of each problem. It also focuses on the most relevant proposed solutions. This book illustrates actual implementations of algorithms that helps the reader deal with these problems. This book stresses the gap that exists between big, raw data and the requirements of quality data that businesses are demanding. This is called Smart Data, and to achieve Smart Data the preprocessing is a key step, where the imperfections, integration tasks and other processes are carried out to eliminate superfluous information. The authors present the concept of Smart Data through data preprocessing in Big Data scenarios and connect it with the emerging paradigms of IoT and edge computing, where the end points generate Smart Data without completely relying on the cloud.Finally, this book provides some novel areas of study that are gathering a deeper attention on the Big Data preprocessing. Specifically, it considers the relation with Deep Learning (as of a technique that also relies in large volumes of data), the difficulty of finding the appropriate selection and concatenation of preprocessing techniques applied and some other open problems.Practitioners and data scientists who work in this field, and want to introduce themselves to preprocessing in large data volume scenarios will want to purchase this book. Researchers that work in this field, who want to know which algorithms are currently implemented to help their investigations, may also be interested in this book.

Copyright: 2020

ISBN: 9783030391058

Features: Contains images

Big Data Privacy and Security in Smart Cities (Advanced Sciences and Technologies for Security Applications)

by Richard Jiang Ahmed Bouridane Chang-Tsun Li Danny Crookes Said Boussakta Feng Hao Eran A. Edirisinghe

This book highlights recent advances in smart cities technologies, with a focus on new technologies such as biometrics, blockchains, data encryption, data mining, machine learning, deep learning, cloud security, and mobile security. During the past five years, digital cities have been emerging as a technology reality that will come to dominate the usual life of people, in either developed or developing countries. Particularly, with big data issues from smart cities, privacy and security have been a widely concerned matter due to its relevance and sensitivity extensively present in cybersecurity, healthcare, medical service, e-commercial, e-governance, mobile banking, e-finance, digital twins, and so on. These new topics rises up with the era of smart cities and mostly associate with public sectors, which are vital to the modern life of people. This volume summarizes the recent advances in addressing the challenges on big data privacy and security in smart cities and points out the future research direction around this new challenging topic.

Copyright: 2022

ISBN: 9783031044243

Features: Contains images

Big Data Privacy Preservation for Cyber-Physical Systems (SpringerBriefs in Electrical and Computer Engineering)

by Miao Pan Jingyi Wang Sai Mounika Errapotu Xinyue Zhang Jiahao Ding Zhu Han

This SpringerBrief mainly focuses on effective big data analytics for CPS, and addresses the privacy issues that arise on various CPS applications. The authors develop a series of privacy preserving data analytic and processing methodologies through data driven optimization based on applied cryptographic techniques and differential privacy in this brief. This brief also focuses on effectively integrating the data analysis and data privacy preservation techniques to provide the most desirable solutions for the state-of-the-art CPS with various application-specific requirements. Cyber-physical systems (CPS) are the “next generation of engineered systems,” that integrate computation and networking capabilities to monitor and control entities in the physical world. Multiple domains of CPS typically collect huge amounts of data and rely on it for decision making, where the data may include individual or sensitive information, for e.g., smart metering, intelligent transportation, healthcare, sensor/data aggregation, crowd sensing etc. This brief assists users working in these areas and contributes to the literature by addressing data privacy concerns during collection, computation or big data analysis in these large scale systems. Data breaches result in undesirable loss of privacy for the participants and for the entire system, therefore identifying the vulnerabilities and developing tools to mitigate such concerns is crucial to build high confidence CPS.This Springerbrief targets professors, professionals and research scientists working in Wireless Communications, Networking, Cyber-Physical Systems and Data Science. Undergraduate and graduate-level students interested in Privacy Preservation of state-of-the-art Wireless Networks and Cyber-Physical Systems will use this Springerbrief as a study guide.

Copyright: 2019

ISBN: 9783030133702

Features: Contains images

Big Data Processing Using Spark in Cloud (Studies in Big Data #43)

by Mamta Mittal Valentina E. Balas Lalit Mohan Goyal Raghvendra Kumar

The book describes the emergence of big data technologies and the role of Spark in the entire big data stack. It compares Spark and Hadoop and identifies the shortcomings of Hadoop that have been overcome by Spark. The book mainly focuses on the in-depth architecture of Spark and our understanding of Spark RDDs and how RDD complements big data’s immutable nature, and solves it with lazy evaluation, cacheable and type inference. It also addresses advanced topics in Spark, starting with the basics of Scala and the core Spark framework, and exploring Spark data frames, machine learning using Mllib, graph analytics using Graph X and real-time processing with Apache Kafka, AWS Kenisis, and Azure Event Hub. It then goes on to investigate Spark using PySpark and R. Focusing on the current big data stack, the book examines the interaction with current big data tools, with Spark being the core processing layer for all types of data. The book is intended for data engineers and scientists working on massive datasets and big data technologies in the cloud. In addition to industry professionals, it is helpful for aspiring data processing professionals and students working in big data processing and cloud computing environments.

ISBN: 9789811305504

Features: Contains images

Big Data Processing with Apache Spark: Efficiently tackle large datasets and big data analysis with Spark and Python

by Manuel Ignacio Franco Galeano

No need to spend hours ploughing through endless data – let Spark, one of the fastest big data processing engines available, do the hard work for you. Key Features Get up and running with Apache Spark and Python Integrate Spark with AWS for real-time analytics Apply processed data streams to machine learning APIs of Apache Spark Book Description Processing big data in real time is challenging due to scalability, information consistency, and fault-tolerance. This book teaches you how to use Spark to make your overall analytical workflow faster and more efficient. You'll explore all core concepts and tools within the Spark ecosystem, such as Spark Streaming, the Spark Streaming API, machine learning extension, and structured streaming. You'll begin by learning data processing fundamentals using Resilient Distributed Datasets (RDDs), SQL, Datasets, and Dataframes APIs. After grasping these fundamentals, you'll move on to using Spark Streaming APIs to consume data in real time from TCP sockets, and integrate Amazon Web Services (AWS) for stream consumption. By the end of this book, you'll not only have understood how to use machine learning extensions and structured streams but you'll also be able to apply Spark in your own upcoming big data projects. What you will learn Write your own Python programs that can interact with Spark Implement data stream consumption using Apache Spark Recognize common operations in Spark to process known data streams Integrate Spark streaming with Amazon Web Services (AWS) Create a collaborative filtering model with the movielens dataset Apply processed data streams to Spark machine learning APIs Who this book is for Data Processing with Apache Spark is for you if you are a software engineer, architect, or IT professional who wants to explore distributed systems and big data analytics. Although you don't need any knowledge of Spark, prior experience of working with Python is recommended.

ISBN: 9781789804522

Features: Contains images

Big Data Processing with Apache Spark: Efficiently tackle large datasets and big data analysis with Spark and Python

by Manuel Ignacio Franco Galeano

No need to spend hours ploughing through endless data – let Spark, one of the fastest big data processing engines available, do the hard work for you. Key Features Get up and running with Apache Spark and Python Integrate Spark with AWS for real-time analytics Apply processed data streams to machine learning APIs of Apache Spark Book Description Processing big data in real time is challenging due to scalability, information consistency, and fault-tolerance. This book teaches you how to use Spark to make your overall analytical workflow faster and more efficient. You'll explore all core concepts and tools within the Spark ecosystem, such as Spark Streaming, the Spark Streaming API, machine learning extension, and structured streaming. You'll begin by learning data processing fundamentals using Resilient Distributed Datasets (RDDs), SQL, Datasets, and Dataframes APIs. After grasping these fundamentals, you'll move on to using Spark Streaming APIs to consume data in real time from TCP sockets, and integrate Amazon Web Services (AWS) for stream consumption. By the end of this book, you'll not only have understood how to use machine learning extensions and structured streams but you'll also be able to apply Spark in your own upcoming big data projects. What you will learn Write your own Python programs that can interact with Spark Implement data stream consumption using Apache Spark Recognize common operations in Spark to process known data streams Integrate Spark streaming with Amazon Web Services (AWS) Create a collaborative filtering model with the movielens dataset Apply processed data streams to Spark machine learning APIs Who this book is for Data Processing with Apache Spark is for you if you are a software engineer, architect, or IT professional who wants to explore distributed systems and big data analytics. Although you don't need any knowledge of Spark, prior experience of working with Python is recommended.

ISBN: 9781789804522

Features:

Big Data Revolution: What farmers, doctors and insurance agents teach us about discovering big data patterns

by Rob Thomas Patrick McSharry

Exploit the power and potential of Big Data to revolutionize business outcomes Big Data Revolution is a guide to improving performance, making better decisions, and transforming business through the effective use of Big Data. In this collaborative work by an IBM Vice President of Big Data Products and an Oxford Research Fellow, this book presents inside stories that demonstrate the power and potential of Big Data within the business realm. Readers are guided through tried-and-true methodologies for getting more out of data, and using it to the utmost advantage. This book describes the major trends emerging in the field, the pitfalls and triumphs being experienced, and the many considerations surrounding Big Data, all while guiding readers toward better decision making from the perspective of a data scientist. Companies are generating data faster than ever before, and managing that data has become a major challenge. With the right strategy, Big Data can be a powerful tool for creating effective business solutions – but deep understanding is key when applying it to individual business needs. Big Data Revolution provides the insight executives need to incorporate Big Data into a better business strategy, improving outcomes with innovation and efficient use of technology. Examine the major emerging patterns in Big Data Consider the debate surrounding the ethical use of data Recognize patterns and improve personal and organizational performance Make more informed decisions with quantifiable results In an information society, it is becoming increasingly important to make sense of data in an economically viable way. It can drive new revenue streams and give companies a competitive advantage, providing a way forward for businesses navigating an increasingly complex marketplace. Big Data Revolution provides expert insight on the tool that can revolutionize industries.

ISBN: 9781118943724

Features: Contains images

Big Data Revolution: What farmers, doctors and insurance agents teach us about discovering big data patterns

by Rob Thomas Patrick McSharry

Exploit the power and potential of Big Data to revolutionize business outcomes Big Data Revolution is a guide to improving performance, making better decisions, and transforming business through the effective use of Big Data. In this collaborative work by an IBM Vice President of Big Data Products and an Oxford Research Fellow, this book presents inside stories that demonstrate the power and potential of Big Data within the business realm. Readers are guided through tried-and-true methodologies for getting more out of data, and using it to the utmost advantage. This book describes the major trends emerging in the field, the pitfalls and triumphs being experienced, and the many considerations surrounding Big Data, all while guiding readers toward better decision making from the perspective of a data scientist. Companies are generating data faster than ever before, and managing that data has become a major challenge. With the right strategy, Big Data can be a powerful tool for creating effective business solutions – but deep understanding is key when applying it to individual business needs. Big Data Revolution provides the insight executives need to incorporate Big Data into a better business strategy, improving outcomes with innovation and efficient use of technology. Examine the major emerging patterns in Big Data Consider the debate surrounding the ethical use of data Recognize patterns and improve personal and organizational performance Make more informed decisions with quantifiable results In an information society, it is becoming increasingly important to make sense of data in an economically viable way. It can drive new revenue streams and give companies a competitive advantage, providing a way forward for businesses navigating an increasingly complex marketplace. Big Data Revolution provides expert insight on the tool that can revolutionize industries.

ISBN: 9781118943731

Features:

Big Data Science and Analytics for Smart Sustainable Urbanism: Unprecedented Paradigmatic Shifts and Practical Advancements (Advances in Science, Technology & Innovation)

by Simon Elias Bibri

We are living at the dawn of what has been termed ‘the fourth paradigm of science,’ a scientific revolution that is marked by both the emergence of big data science and analytics, and by the increasing adoption of the underlying technologies in scientific and scholarly research practices. Everything about science development or knowledge production is fundamentally changing thanks to the ever-increasing deluge of data. This is the primary fuel of the new age, which powerful computational processes or analytics algorithms are using to generate valuable knowledge for enhanced decision-making, and deep insights pertaining to a wide variety of practical uses and applications. This book addresses the complex interplay of the scientific, technological, and social dimensions of the city, and what it entails in terms of the systemic implications for smart sustainable urbanism. In concrete terms, it explores the interdisciplinary and transdisciplinary field of smart sustainable urbanism and the unprecedented paradigmatic shifts and practical advances it is undergoing in light of big data science and analytics. This new era of science and technology embodies an unprecedentedly transformative and constitutive power—manifested not only in the form of revolutionizing science and transforming knowledge, but also in advancing social practices, producing new discourses, catalyzing major shifts, and fostering societal transitions. Of particular relevance, it is instigating a massive change in the way both smart cities and sustainable cities are studied and understood, and in how they are planned, designed, operated, managed, and governed in the face of urbanization. This relates to what has been dubbed data-driven smart sustainable urbanism, an emerging approach based on a computational understanding of city systems and processes that reduces urban life to logical and algorithmic rules and procedures, while also harnessing urban big data to provide a more holistic and integrated view or synoptic intelligence of the city. This is increasingly being directed towards improving, advancing, and maintaining the contribution of both sustainable cities and smart cities to the goals of sustainable development. This timely and multifaceted book is aimed at a broad readership. As such, it will appeal to urban scientists, data scientists, urbanists, planners, engineers, designers, policymakers, philosophers of science, and futurists, as well as all readers interested in an overview of the pivotal role of big data science and analytics in advancing every academic discipline and social practice concerned with data–intensive science and its application, particularly in relation to sustainability.

ISBN: 9783030173128

Features: Contains images

Big Data Science in Finance

by Irene Aldridge M. Avellaneda

Explains the mathematics, theory, and methods of Big Data as applied to finance and investing Data science has fundamentally changed Wall Street—applied mathematics and software code are increasingly driving finance and investment-decision tools. Big Data Science in Finance examines the mathematics, theory, and practical use of the revolutionary techniques that are transforming the industry. Designed for mathematically-advanced students and discerning financial practitioners alike, this energizing book presents new, cutting-edge content based on world-class research taught in the leading Financial Mathematics and Engineering programs in the world. Marco Avellaneda, a leader in quantitative finance, and quantitative methodology author Irene Aldridge help readers harness the power of Big Data. Comprehensive in scope, this book offers in-depth instruction on how to separate signal from noise, how to deal with missing data values, and how to utilize Big Data techniques in decision-making. Key topics include data clustering, data storage optimization, Big Data dynamics, Monte Carlo methods and their applications in Big Data analysis, and more. This valuable book: Provides a complete account of Big Data that includes proofs, step-by-step applications, and code samples Explains the difference between Principal Component Analysis (PCA) and Singular Value Decomposition (SVD) Covers vital topics in the field in a clear, straightforward manner Compares, contrasts, and discusses Big Data and Small Data Includes Cornell University-tested educational materials such as lesson plans, end-of-chapter questions, and downloadable lecture slides Big Data Science in Finance: Mathematics and Applications is an important, up-to-date resource for students in economics, econometrics, finance, applied mathematics, industrial engineering, and business courses, and for investment managers, quantitative traders, risk and portfolio managers, and other financial practitioners.

ISBN: 9781119602996

Features:

Big Data Science in Finance

by Irene Aldridge Marco Avellaneda

Explains the mathematics, theory, and methods of Big Data as applied to finance and investing Data science has fundamentally changed Wall Street—applied mathematics and software code are increasingly driving finance and investment-decision tools. Big Data Science in Finance examines the mathematics, theory, and practical use of the revolutionary techniques that are transforming the industry. Designed for mathematically-advanced students and discerning financial practitioners alike, this energizing book presents new, cutting-edge content based on world-class research taught in the leading Financial Mathematics and Engineering programs in the world. Marco Avellaneda, a leader in quantitative finance, and quantitative methodology author Irene Aldridge help readers harness the power of Big Data. Comprehensive in scope, this book offers in-depth instruction on how to separate signal from noise, how to deal with missing data values, and how to utilize Big Data techniques in decision-making. Key topics include data clustering, data storage optimization, Big Data dynamics, Monte Carlo methods and their applications in Big Data analysis, and more. This valuable book: Provides a complete account of Big Data that includes proofs, step-by-step applications, and code samples Explains the difference between Principal Component Analysis (PCA) and Singular Value Decomposition (SVD) Covers vital topics in the field in a clear, straightforward manner Compares, contrasts, and discusses Big Data and Small Data Includes Cornell University-tested educational materials such as lesson plans, end-of-chapter questions, and downloadable lecture slides Big Data Science in Finance: Mathematics and Applications is an important, up-to-date resource for students in economics, econometrics, finance, applied mathematics, industrial engineering, and business courses, and for investment managers, quantitative traders, risk and portfolio managers, and other financial practitioners.

ISBN: 9781119602972

Features: Contains images

Refine Search