projets:plim:20152016:gr6
Différences
Ci-dessous, les différences entre deux révisions de la page.
Les deux révisions précédentesRévision précédenteProchaine révision | Révision précédente | ||
projets:plim:20152016:gr6 [2015/11/22 15:56] – [Data manipulation] chammam | projets:plim:20152016:gr6 [2015/11/24 17:34] (Version actuelle) – [Machine learning and clustering] boussarsar | ||
---|---|---|---|
Ligne 39: | Ligne 39: | ||
==== Machine learning and clustering ==== | ==== Machine learning and clustering ==== | ||
- | Based on trails information (distance and duration) the next step is to build a dataset, the database. K-Means is the supervised | + | Based on trails information (distance and duration) the next step is to build a dataset, the database. K-Means is the unsupervised |
The first step is to initialize the input which is a bi-dimensional array of distance and duration, choosing the appropriate distance measure mechanism and fixing the number of clusters. After that, the algorithm chooses randomly n centroids based on the number of clusters, which is called k, and calculates the distance between a given centroid and all the other points based on Euclidean distance. After initializing the clusters, the algorithm finds the nearest point in the centre of the cluster and call it centroid by calculating intra-cluster distance (distance between points in the same cluster). It also calculates the inter-cluster distance to identify which point belongs to which cluster (distance between the centroid of each cluster and points of other clusters). The system repeats these instructions until the stability of cluster centroids or reaching the iterations limit. | The first step is to initialize the input which is a bi-dimensional array of distance and duration, choosing the appropriate distance measure mechanism and fixing the number of clusters. After that, the algorithm chooses randomly n centroids based on the number of clusters, which is called k, and calculates the distance between a given centroid and all the other points based on Euclidean distance. After initializing the clusters, the algorithm finds the nearest point in the centre of the cluster and call it centroid by calculating intra-cluster distance (distance between points in the same cluster). It also calculates the inter-cluster distance to identify which point belongs to which cluster (distance between the centroid of each cluster and points of other clusters). The system repeats these instructions until the stability of cluster centroids or reaching the iterations limit. | ||
- | ==== Clustering | + | ==== User matching |
- | Clustering every " | ||
- | |||
- | ==== User matching ==== | ||
- | When a user searches for a hiking | + | The main goal of this application is to give the users the possibility to find people, trail buddies, based on their hiking |
+ | The comparison mechanism is based on two conditions; two trail buddies are similar, if the number of trails | ||
=== Developped SOFTWARE === | === Developped SOFTWARE === |
projets/plim/20152016/gr6.1448207804.txt.gz · Dernière modification : 2015/11/22 15:56 de chammam