BigDataFr recommends: Sparse p-Adic Data Coding for Computationally Efficient and Effective Big Data Analytics
[…]We develop the theory and practical implementation of p-adic sparse coding of data. Rather than the standard, sparsifying criterion that uses the L0 pseudo-norm, we use the p-adic norm. We require that the hierarchy or tree be node-ranked, as is standard practice in agglomerative and other hierarchical clustering, but not necessarily with decision trees. In order to structure the data, all computational processing operations are direct reading of the data, or are bounded by a constant number of direct readings of the data, implying linear computational time. […]
Read Paper
By Fionn Murtagh
Source: arxiv.org