Amazon cover image
Image from Amazon.com

Feature engineering for machine learning and data analytics

Contributor(s): Material type: TextTextSeries: Chapman and Hall/CRC data mining and knowledge discovery series; no. 44Publication details: Boca Raton Chapman and Hall/CRC Press 2018Description: xviii, 400 pISBN:
  • 9781138744387
Subject(s): DDC classification:
  • 006.31 F3
Summary: Feature engineering plays a vital role in big data analytics. Machine learning and data mining algorithms cannot work without data. Little can be achieved if there are few features to represent the underlying data objects, and the quality of results of those algorithms largely depends on the quality of the available features. Feature Engineering for Machine Learning and Data Analytics provides a comprehensive introduction to feature engineering, including feature generation, feature extraction, feature transformation, feature selection, and feature analysis and evaluation. The book presents key concepts, methods, examples, and applications, as well as chapters on feature engineering for major data types such as texts, images, sequences, time series, graphs, streaming data, software engineering data, Twitter data, and social media data. It also contains generic feature generation approaches, as well as methods for generating tried-and-tested, hand-crafted, domain-specific features. The first chapter defines the concepts of features and feature engineering, offers an overview of the book, and provides pointers to topics not covered in this book. The next six chapters are devoted to feature engineering, including feature generation for specific data types. The subsequent four chapters cover generic approaches for feature engineering, namely feature selection, feature transformation based feature engineering, deep learning based feature engineering, and pattern based feature generation and engineering. The last three chapters discuss feature engineering for social bot detection, software management, and Twitter-based applications respectively. This book can be used as a reference for data analysts, big data scientists, data preprocessing workers, project managers, project developers, prediction modelers, professors, researchers, graduate students, and upper level undergraduate students. It can also be used as the primary text for courses on feature engineering, or as a supplement for courses on machine learning, data mining, and big data analytics.
Tags from this library: No tags from this library for this title. Log in to add tags.
Star ratings
    Average rating: 0.0 (0 votes)

Table of Contents 1. Preliminaries and Overview Guozhu Dong and Huan Liu Preliminaries Overview of the Chapters Beyond this Book 2 Feature Engineering for Text Data Chase Geigle, Qiaozhu Mei, and ChengXiang Zhai Overview of Text Representation Text as Strings Sequence of Words Representation Bag of Words Representation Structural Representation of Text Latent Semantic Representation Explicit Semantic Representation Embeddings for Text Representation Context-Sensitive Text Representation   3 Feature Extraction and Learning for Visual Data Parag S. Chandakkar, Ragav Venkatesan, and Baoxin Li Classical Visual Feature Representations Latent-feature Extraction Deep Image Features 4 Feature-based time-series analysis Ben D. Fulcher Feature-based representations of time series Global features Subsequence features Combining time-series representations Feature-based forecasting 5 Feature Engineering for Data Streams Yao Ma, Jiliang Tang, and Charu Aggarwal Streaming Settings Linear Methods for Streaming Feature Construction Non-linear Methods for Streaming Feature Construction Feature Selection for Data Streams with Streaming Feature Feature Selection for Data Streams with Streaming Instances Discussions and Challenges 6 Feature Generation and Feature Engineering for Sequences Guozhu Dong, Lei Duan, Jyrki Nummenmaa, and Peng Zhang Basics on Sequence Data and Sequence Patterns Approaches to Using Patterns in Sequence Features Traditional Pattern-Based Sequence Features Mined Sequence Patterns for Use in Sequence Features Sequence Features Not De_ned by Patterns Sequence Databases 7 Feature Generation for Graphs and Networks Yuan Yao, Hanghang Tong, Feng Xu, and Jian Lu Feature Types Feature Generation . Feature Usages Future Directions 8 Feature Selection and Evaluation Yun Li and Tao Li Feature Selection Frameworks Advanced Topics for Feature Selection Future Work and Conclusion 9 Automating Feature Engineering in Supervised Learning Udayan Khurana A Few Simple Approaches Hierarchical Exploration of Feature Transformations Learning Optimal Traversal Policy Finding E_ective Features without Model Training Miscellenious 10 Pattern based Feature Generation Yunzhe Jia, James Bailey, Ramamohanarao Kotagiri, and Christopher Leckie Preliminaries Framework of pattern based feature generation Pattern mining algorithms Pattern selection approaches . Pattern based feature generation Pattern based feature generation for classi_cation Pattern based feature generation for clustering 11 Deep Learning for Feature Representation Suhang Wang and Huan Liu Restricted Boltzmann Machine AutoEncoder Convolutional Neural Networks Word Embedding and Recurrent Neural Networks . Generative Adversarial Networks and Variational Autoencoder Discussion and Further Readings 12 Feature Engineering for Social Bot Detection Onur Varol, Clayton A. Davis, Filippo Menczer, and Alessandro Flammini Social bot detection . Online bot detection framework 13 Feature Generation and Engineering for Software Analytics Xin Xia and David Lo Features for Defect Prediction Features for Crash Release Prediction for Apps Features from Mining Monthly Reports to Predict Developer Turnover 14 Feature Engineering for Twitter-based Applications Sanjaya Wijeratne, Amit Sheth, Shrenyansh Bhatt, Lakshika Balasuriya, Hussein S. Al-Olimat, Manas Gaur, Amir Hossein Yazdavar, Krishnaprasad Thirunarayan

Feature engineering plays a vital role in big data analytics. Machine learning and data mining algorithms cannot work without data. Little can be achieved if there are few features to represent the underlying data objects, and the quality of results of those algorithms largely depends on the quality of the available features. Feature Engineering for Machine Learning and Data Analytics provides a comprehensive introduction to feature engineering, including feature generation, feature extraction, feature transformation, feature selection, and feature analysis and evaluation. The book presents key concepts, methods, examples, and applications, as well as chapters on feature engineering for major data types such as texts, images, sequences, time series, graphs, streaming data, software engineering data, Twitter data, and social media data. It also contains generic feature generation approaches, as well as methods for generating tried-and-tested, hand-crafted, domain-specific features. The first chapter defines the concepts of features and feature engineering, offers an overview of the book, and provides pointers to topics not covered in this book. The next six chapters are devoted to feature engineering, including feature generation for specific data types. The subsequent four chapters cover generic approaches for feature engineering, namely feature selection, feature transformation based feature engineering, deep learning based feature engineering, and pattern based feature generation and engineering. The last three chapters discuss feature engineering for social bot detection, software management, and Twitter-based applications respectively. This book can be used as a reference for data analysts, big data scientists, data preprocessing workers, project managers, project developers, prediction modelers, professors, researchers, graduate students, and upper level undergraduate students. It can also be used as the primary text for courses on feature engineering, or as a supplement for courses on machine learning, data mining, and big data analytics.

There are no comments on this title.

to post a comment.

Powered by Koha