Skip to content

Gather more data for the chatbot's database via crowd-sourcing #53

@janakrajchadha

Description

@janakrajchadha

Requirement
The sentences.csv file has very limited data which can be used for the initial training. The aim is to gather more data via crowd-sourcing and sources to help improve the responses of the bot via ML models.

Pre-requisite

Elementary knowledge of Python
Elementary understanding of the available data

Dependencies
None

Description
This is an open-ended issue where participants can explore crowd-sourcing to gather the data required for improving the bot's NLP capabilities. We can either look at using a crowd-sourcing platform (like Amazon Mechanical Turks) or a simple survey form distributed amongst friends.

The primary aim with this bit would be to get a wide variety of questions that people may ask a mapbot i.e. a bot which can answer direction and location information related queries primarily. Please provide the details of the different APIs we're planning to include in the bot and ask folks to frame their questions based on the set of available capabilites.

As discussed in a similar issue #52, elementary pre-processing of the data might be required before we put it in the db. Please look at sentences.csv to get an idea of the kind of questions we're handling right now.

Please review your method of gathering data before actually putting it up on a site or sharing it with your friends/batchmates/colleagues

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementgssoc20Issues to be picked up by participants during GSSoC 2020mediumMedium level issue GSSoC 2020

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions