Skip to content

CoP: Data Science: SEIE Survey Analysis #26

@ExperimentsInHonesty

Description

@ExperimentsInHonesty

Overview

We need to report on our progress for our partner at the Department of Neighborhood Empowerment

Action Items

Phase 1--Completed by Sathwik:

  • NLP engineering/analysis on a large survey featuring free text columns.
  • Run TF-IDF analysis
  • create word clouds
  • Create a presentation to show data to DONE and neighborhood council members. Tooling could include scikit-learn, spacy, pandas. First version here

Phase 2--Picked up by Henry (8.2.2021):

  • Create filtering system on google sheets (labels)
  • Revisit preprocess script and apply to dataset (ex. remove stopwords, puncuation, duplicates, etc)
  • Frequency counts on most common unigrams, bigrams and trigrams for One Question
    - [x] On the entire dataset
    - [x] By Region
  • Create clear view of responses containing key phrases
  • Number of council members who mention the "top" topics
  • Present to Julien

Phase 3

  • Expand to the rest of the questionnaire
  • Create dashboard with results of quantitative analysis, showing comparisons of themes topic by region, and comparisons of regions grouped by topic. (Dashboard link)
  • Develop presentation/write-up for final delivery to Julien

Resources/Instructions

Currently Underway:
@henrykaplan

Metadata

Metadata

Assignees

Type

No type

Projects

Status

Done

Status

Filled

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions