-
Notifications
You must be signed in to change notification settings - Fork 107
Description
Requirement
The sentences.csv file has very limited data which can be used for the initial training. The aim is to gather more data via crowd-sourcing and sources to help improve the responses of the bot via ML models.
Pre-requisite
Elementary knowledge of Python
Elementary understanding of the available data
Dependencies
None
Description
This is an open-ended issue where participants can explore crowd-sourcing to gather the data required for improving the bot's NLP capabilities. We can either look at using a crowd-sourcing platform (like Amazon Mechanical Turks) or a simple survey form distributed amongst friends.
The primary aim with this bit would be to get a wide variety of questions that people may ask a mapbot i.e. a bot which can answer direction and location information related queries primarily. Please provide the details of the different APIs we're planning to include in the bot and ask folks to frame their questions based on the set of available capabilites.
As discussed in a similar issue #52, elementary pre-processing of the data might be required before we put it in the db. Please look at sentences.csv to get an idea of the kind of questions we're handling right now.
Please review your method of gathering data before actually putting it up on a site or sharing it with your friends/batchmates/colleagues