ABSTRACT
Federated learning (FL), which enables collaborative learning without revealing raw data, is an emerging topic in privacy-preserving machine learning. Based on our experiences in thousands of real-world applications, time-series feature extraction plays a significant role in improving model quality. In this work, we propose a system automatically integrating time series feature extraction for training FL models. Our experiments show that by adopting time series feature extraction, the model accuracy (AUC) is improved by 3% on average, and recall is increased by 10% in recommender systems. We have open-sourced the project https://github.com/4paradigm/tsfe and provided a step by step demonstration on how audiences can use our system to create their own FL pipeline that extracts time series features. Demonstration video at: https://youtu.be/UW27dWT-ays
Supplemental Material
- Abbas Acar, Hidayet Aksu, A. Selcuk Uluagac, and Mauro Conti. 2018. A Survey on Homomorphic Encryption Schemes: Theory and Implementation. ACM Comput. Surv., Vol. 51, 4, Article 79 (July 2018), 35 pages. https://doi.org/10.1145/3214303Google Scholar
Digital Library
- Cheng Chen, Jun Yang, Mian Lu, Taize Wang, Zhao Zheng, Yuqiang Chen, Wenyuan Dai, Bingsheng He, Weng-Fai Wong, Guoan Wu, Yuping Zhao, and Andy Rudoff. 2021. Optimizing In-Memory Database Engine for AI-Powered on-Line Decision Augmentation Using Persistent Memory. Proc. VLDB Endow., Vol. 14, 5 (Jan. 2021), 799--812. https://doi.org/10.14778/3446095.3446102Google Scholar
Digital Library
- K. Cheng, T. Fan, Y. Jin, Y. Liu, T. Chen, D. Papadopoulos, and Q. Yang. 2021. SecureBoost: A Lossless Federated Learning Framework. IEEE Intelligent Systems 01 (May 2021), 1--1. https://doi.org/10.1109/MIS.2021.3082561Google Scholar
Digital Library
- FATE. 2021. Federated AI Technology Enabler. https://github.com/FederatedAI/FATE. Accessed: 2022-06--10.Google Scholar
- GDPR. 2021. General Data Protection Regulation. https://gdpr-info.eu/. Accessed: 2022-06--10.Google Scholar
- Google. 2021. Classification: ROC Curve and AUC. https://developers.google.com/machine-learning/crash-course/classification/roc-and-auc. Accessed: 2022-06--10.Google Scholar
- Stephen Hardy, Wilko Henecka, Hamish Ivey-Law, Richard Nock, Giorgio Patrini, Guillaume Smith, and Brian Thorne. 2017. Private federated learning on vertically partitioned data via entity resolution and additively homomorphic encryption. arXiv preprint arXiv:1711.10677 (2017).Google Scholar
- JD. 2021. JD Prediction of Purchasing Intention of High Potential Users. https://github.com/4paradigm/tsfe/blob/main/DATASET.md. Accessed: 2022-06--10.Google Scholar
- Peter Kairouz, H Brendan McMahan, Brendan Avent, Aurélien Bellet, Mehdi Bennis, Arjun Nitin Bhagoji, Kallista Bonawitz, Zachary Charles, Graham Cormode, Rachel Cummings, et al. 2021. Advances and open problems in federated learning. Foundations and Trends® in Machine Learning, Vol. 14, 1--2 (2021), 1--210.Google Scholar
- Qinbin Li, Zeyi Wen, Zhaomin Wu, Sixu Hu, Naibo Wang, Yuan Li, Xu Liu, and Bingsheng He. 2021. A Survey on Federated Learning Systems: Vision, Hype and Reality for Data Privacy and Protection. arxiv: 1907.09693 [cs.LG]Google Scholar
- Tian Li, Anit Kumar Sahu, Ameet Talwalkar, and Virginia Smith. 2020. Federated learning: Challenges, methods, and future directions. IEEE Signal Processing Magazine, Vol. 37, 3 (2020), 50--60.Google Scholar
Cross Ref
- Brendan McMahan, Eider Moore, Daniel Ramage, Seth Hampson, and Blaise Aguera y Arcas. 2017. Communication-efficient learning of deep networks from decentralized data. In Artificial intelligence and statistics. PMLR, 1273--1282.Google Scholar
- OpenMLDB. 2021. An Open Source Database for Machine Learning Systems. https://github.com/4paradigm/OpenMLDB. Accessed: 2022-06--10.Google Scholar
- Pascal Paillier. 1999. Public-Key Cryptosystems Based on Composite Degree Residuosity Classes. In Advances in Cryptology -- EUROCRYPT '99, Jacques Stern (Ed.). Springer Berlin Heidelberg, Berlin, Heidelberg, 223--238.Google Scholar
- PDPA. 2021. Personal Data Protection Act. https://www.pdpc.gov.sg/Overview-of-PDPA/The-Legislation/Personal-Data-Protection-Act. Accessed: 2022-06--10.Google Scholar
- Qiang Yang, Yang Liu, Tianjian Chen, and Yongxin Tong. 2019. Federated Machine Learning: Concept and Applications. ACM Trans. Intell. Syst. Technol., Vol. 10, 2, Article 12 (Jan. 2019), 19 pages. https://doi.org/10.1145/3298981Google Scholar
Digital Library
Index Terms
A System for Time Series Feature Extraction in Federated Learning
Recommendations
Time Series FeatuRe Extraction on basis of Scalable Hypothesis tests (tsfresh – A Python package)
AbstractTime series feature engineering is a time-consuming process because scientists and engineers have to consider the multifarious algorithms of signal processing and time series analysis for identifying and extracting meaningful features ...
Orthogonal feature learning for time series clustering
This paper presents a new method that uses orthogonalized features for time series clustering and classification. To cluster or classify time series data, either original data or features extracted from the data are used as input for various clustering ...
A federated learning system with enhanced feature extraction for human activity recognition
AbstractWith the rapid growth of mobile devices, wearable sensor-based human activity recognition (HAR) has become one of the hottest topics in the Internet of Things. However, it is challenging for traditional approaches to achieving high ...






Comments