[arXiv] BigDataFr recommends: Measuring Economic Resilience to Natural Disasters with Big Economic Transaction Data

BigDataFr recommends: Measuring Economic Resilience to Natural Disasters with Big Economic Transaction Data Subjects: Databases (cs.DB) […] This research explores the potential to analyze bank card payments and ATM cash withdrawals in order to map and quantify how people are impacted by and recover from natural disasters. Our approach defines a disaster-affected community’s economic recovery time […]

[arXiv] BigDataFr recommends : Big Data analytics. Three use cases with R, Python and #Spark #datascientist

BigDataFr recommends: Big Data analytics. Three use cases with R, Python and Spark Subjects: Applications (stat.AP); Learning (cs.LG) […] Management and analysis of big data are systematically associated with a data distributed architecture in the Hadoop and now Spark frameworks. This article offers an introduction for statisticians to these technologies by comparing the performance obtained […]

[arXiv] BigDataFr recommends: The Scalable Langevin Exact Algorithm: Bayesian Inference for Big Data

BigDataFr recommends: The Scalable Langevin Exact Algorithm: Bayesian Inference for Big Data Subjects: Methodology (stat.ME); Computation (stat.CO) […] This paper introduces a class of Monte Carlo algorithms which are based upon simulating a Markov process whose quasi-stationary distribution coincides with the distribution of interest. This differs fundamentally from, say, current Markov chain Monte Carlo in […]

[arXiv] BigDataFr recommends: Human-Algorithm Interaction Biases in the Big Data Cycle: A Markov Chain Iterated Learning Framework

BigDataFr recommends: Human-Algorithm Interaction Biases in the Big Data Cycle: A Markov Chain Iterated Learning Framework Comments: This research was supported by National Science Foundation grant NSF-1549981 Subjects: Learning (cs.LG); Human-Computer Interaction (cs.HC) […] Early supervised machine learning algorithms have relied on reliable expert labels to build predictive models. However, the gates of data generation […]

[arXiv] BigDataFr recommends: Clustering Mixed Datasets Using Homogeneity Analysis with Applications to Big Data

BigDataFr recommends: Clustering Mixed Datasets Using Homogeneity Analysis with Applications to Big Data Subjects: Machine Learning (stat.ML) […] Clustering datasets with a mix of continuous and categorical attributes is encountered routinely by data analysts. This work presents a method to clustering such datasets using Homogeneity Analysis. An Optimal Euclidean representation of mixed datasets is obtained […]

[arXiv] BigDataFr recommends: Asynchronous Parallel Algorithms for Nonconvex Big-Data Optimization

BigDataFr recommends: Asynchronous Parallel Algorithms for Nonconvex Big-Data Optimization: Model and Convergence Subjects: Optimization and Control (math.OC); Distributed, Parallel, and Cluster Computing (cs.DC) […] We propose a novel asynchronous parallel algorithmic framework for the minimization of the sum of a smooth nonconvex function and a convex nonsmooth regularizer, subject to both convex and nonconvex constraints. […]

[Datasciencecentral] BigDataFr recommends: A methodology for solving problems with DataScience for Internet of Things – Full – #iot

BigDataFr recommends: A methodology for solving problems with DataScience for Internet of Things – Part 1 and 2 […] This two part blog is based on my forthcoming book: Data Science for Internet of Things. It is also the basis for the course I teach Data Science for Internet of Things Course. I will be […]