projets:plim:20152016:gr6
Différences
Ci-dessous, les différences entre deux révisions de la page.
Les deux révisions précédentesRévision précédenteProchaine révision | Révision précédente | ||
projets:plim:20152016:gr6 [2015/11/22 11:22] – [User matching] chammam | projets:plim:20152016:gr6 [2015/11/24 17:34] (Version actuelle) – [Machine learning and clustering] boussarsar | ||
---|---|---|---|
Ligne 33: | Ligne 33: | ||
===== How it works ===== | ===== How it works ===== | ||
- | ==== Data gathering (acquisition) ==== | ||
- | |||
- | |||
- | * GPS data (latitude, longitude) => distance | ||
- | * time-stamps => duration | ||
- | | ||
==== Data manipulation ==== | ==== Data manipulation ==== | ||
- | Measuring | + | After collecting sensors raw data which are a set of latitude and longitude couple called Geopoints, it is time to calculate |
+ | To do so, the application' | ||
+ | ==== Machine learning and clustering ==== | ||
- | ==== Clustering ==== | + | Based on trails information (distance and duration) the next step is to build a dataset, the database. K-Means is the unsupervised machine learning algorithm, or simply clustering algorithm, implemented in this solution to ensure grouping similar users together. There are three clusters which are: easyTrails, mediumTrails and hardTrails. |
+ | The first step is to initialize the input which is a bi-dimensional array of distance and duration, choosing the appropriate distance measure mechanism and fixing the number of clusters. After that, the algorithm chooses randomly n centroids based on the number of clusters, which is called k, and calculates the distance between a given centroid and all the other points based on Euclidean distance. After initializing the clusters, the algorithm finds the nearest point in the centre of the cluster and call it centroid by calculating intra-cluster distance (distance between points in the same cluster). It also calculates the inter-cluster distance to identify which point belongs to which cluster (distance between the centroid of each cluster and points of other clusters). The system repeats these instructions until the stability of cluster centroids or reaching the iterations limit. | ||
- | Clustering every " | + | ==== User matching ==== |
- | ==== User matching ==== | ||
- | When a user searches for a hiking | + | The main goal of this application is to give the users the possibility to find people, trail buddies, based on their hiking |
+ | The comparison mechanism is based on two conditions; two trail buddies are similar, if the number of trails | ||
=== Developped SOFTWARE === | === Developped SOFTWARE === |
projets/plim/20152016/gr6.1448191324.txt.gz · Dernière modification : 2015/11/22 11:22 de chammam