Forum analysis

Context and problematic

Medical forums are full of important information, but it remains very complicated to extract and analyze.
Through these forums, one can genuinely understand the “utility of the patent”.

Goals

Analyze the content of medical forums, to highlight trends allowing a better understanding of patient issues and thus initiate concrete actions.

Our intervention

Setting up a data collection system :

  • Scrapping the Sjogrensworld.org forum
  • Inserting data into a MongoDB database

Data Modeling :

  • Descriptive analysis in order to have an overview of the collected data (Number of posts, engagement rate, link between posts, link between users etc…)
  • Discovery of patient-specific themes via topic modeling coupled with sentiment analysis to understand their impacts and thus be able to detect and dissect current and past trends.
  • Extracting latent knowledge from the data via word embedding techniques to detect certain patient issues as early as possible.

Results

  • Scraping 216,000 posts from the sjogrensworld.org forum
  • Creating a MongoDB database
  • Descriptive analysis of unstructured data
  • Discovering themes and relationships specific to patient data
  • Technical environment

    Python
    Scrapy
    MongoDB
    PyTorch
    NLTK
    Spacy
    Sklearn

    Together with our customers, we build solutions that change and facilitate their daily lives.

    Aide à la création de médicaments

    Plateforme d’analyse de besoins clients

    Conception et industrialisation du SI analytics

    Prédiction de retards

    Analyse de visage pour recommandation produits

    Application d’optimisation de la Supply Chain

    Scoring et analyse
    de la peau

    Analyse de Forums

    Personnalisation de contenu

    Analyse des activités de support IT

    Détection de tendances sur les réseaux sociaux

    Détection
    de beaconing

    Outil de classification de documents

    Détection de cancer via Deep Learning

    Conception de plateforme de veille stratégique

    Rendements
    des champs agricoles

    Conception du Data Hub et implémentation

    Analyse et prévention des problèmes Skype

    Assistant d’aide à la recherche

    Classification de pages Web