Using Open Source Platforms for Business Intelligence (The Morgan Kaufmann Series on Business Intelligence)
by Lyndsay Wise
Open Source BI solutions have many advantages over traditional proprietary software, from offering lower initial costs to more flexible support and integration options; but, until now, there has been no comprehensive guide to the complete offerings of the OS BI market. Writing for IT managers and business analysts without bias toward any BI suite, industry insider Lyndsay Wise covers the benefits and challenges of all available open source BI systems and tools, enabling readers to identify the s...
Azure Data Factory Cookbook
by Dmitry Anoshin, Dmitry Foshin, Roman Storchak, and Xenia Ireton
Solve real-world data problems and create data-driven workflows for easy data movement and processing at scale with Azure Data FactoryKey FeaturesLearn how to load and transform data from various sources, both on-premises and on cloudUse Azure Data Factory’s visual environment to build and manage hybrid ETL pipelinesDiscover how to prepare, transform, process, and enrich data to generate key insightsBook DescriptionAzure Data Factory (ADF) is a modern data integration tool available on Microsoft...
Feature Engineering for Machine Learning
by Alice Zheng and Amanda Casari
Feature engineering is a crucial step in the machine-learning pipeline, yet this topic is rarely examined on its own. With this practical book, you'll learn techniques for extracting and transforming features-the numeric representations of raw data-into formats for machine-learning models. Each chapter guides you through a single data problem, such as how to represent text or image data. Together, these examples illustrate the main principles of feature engineering. Rather than simply teach th...
Publisher's Note: Products purchased from Third Party sellers are not guaranteed by the publisher for quality, authenticity, or access to any online entitlements included with the product.Deploy and Maintain an Integrated MDS ArchitectureHarness your master data and grow revenue while reducing administrative costs. Thoroughly revised to cover the latest MDS features, Microsoft SQL Server 2012Master Data Services, Second Edition shows how to implement and manage a centralized, customer-focused MD...
Teradata for Executives (Tera-Tom Genius)
by Tom Coffing and Leslie Nolander
Tietovarastointi Ja Tietohallinnon Strateginen Suunnittelu
by A Tormanen
Impala in Action:Querying and mining big data
by Richard L Saltzer and Istvan Szegedi
DESCRIPTION Hadoop queries in Pig or Hive can be too slow for real-time data analysis. Impala, an ultra-speedy query engine from Cloudera, supercharges Hadoop by avoiding the typical Map-Reduce overhead and parallelizing queries so that they can run on multiple nodes. This is a big deal for big data, because with Impala, querying Hadoop takes seconds rather than minutes. Impala's dialect is close to standard SQL, and Impala seamlessly accesses HBase and HDFS (Hadoop Distributed File Sys...
* This is the first book to provide in--depth coverage of star schema aggregates used in dimensional modeling--from selection and design, to loading and usage, to specific tasks and deliverables for implementation projects* Covers the principles of aggregate schema design and the pros and cons of various types of commercial solutions for navigating and building aggregates* Discusses how to include aggregates in data warehouse development projects that focus on incremental development, iterative...
Without a data strategy, the people within an organization have no guidelines for making decisions that are absolutely crucial to the success of the IT organization and to the entire organization. The absence of a strategy gives a blank check to those who want to pursue their own agendas, including those who want to try new database management systems, new technologies (often unproven), and new tools. This type of environment provides no hope for success. Data Strategy should result in th...
A Methodology for Building the Data Lakehouse
by Bill Inmon, Ranjeet Srivastava, and Patty Haines
ERP & Data Warehousing in Organizations: Issues and Challenges
Harness the power of Microsoft Fabric to develop data analytics solutions for various use cases guided by step-by-step instructions Key Features Explore Microsoft Fabric and its features through real-world examples Build data analytics solutions for lakehouses, data warehouses, real-time analytics, and data science Monitor, manage, and administer your Fabric platform and analytics system to ensure flexibility, performance, security, and control Purchase of the print or Kindle book includes a f...
SQL Server Interview Questions You'll Most Likely Be Asked (Job Interview Questions, #2)
by Vibrant Publishers
Oracle Pl/SQL Interview Questions You'll Most Likely Be Asked (Job Interview Questions, #12)
A practical guide to making good decisions in a world of missing dataIn the era of big data, it is easy to imagine that we have all the information we need to make good decisions. But in fact the data we have are never complete, and may be only the tip of the iceberg. Just as much of the universe is composed of dark matter, invisible to us but nonetheless present, the universe of information is full of dark data that we overlook at our peril. In Dark Data, data expert David Hand takes us on a fa...
Level up your career by learning best practices for managing the data quality and integrity of your financial data Key Features Accelerate data integrity management using artificial intelligence-powered solutions Learn how business intelligence tools, ledger databases, and database locks solve data integrity issues Find out how to detect fraudulent transactions affecting financial report integrity Book DescriptionData integrity management plays a critical role in the success and effectiveness...
Pentaho Data Integration 4 Cookbook
by Adrian Sergio Pulvirenti and Maria Carina Roldan
This book has step-by-step instructions to solve data manipulation problems using PDI in the form of recipes. It has plenty of well-organized tips, screenshots, tables, and examples to aid quick and easy understanding. If you are a software developer or anyone involved or interested in developing ETL solutions, or in general, doing any kind of data manipulation, this book is for you. It does not cover PDI basics, SQL basics, or database concepts. You are expected to have a basic understanding of...
Learning Pentaho Data Integration 8 CE - Third Edition
by Maria Carina Roldan
Get up and running with the Pentaho Data Integration tool using this hands-on, easy-to-read guide About This Book • Manipulate your data by exploring, transforming, validating, and integrating it using Pentaho Data Integration 8 CE • A comprehensive guide exploring the features of Pentaho Data Integration 8 CE • Connect to any database engine, explore the databases, and perform all kind of operations on relational databases Who This Book Is For This book is a must-have for software developer...