Bruna Wundervald


Hamilton Institute &
Department of Statistics
Maynooth University
"Those who know, do. Those that understand, teach."
― Aristotle

Welcome!

I am a statistician, reproducible research enthusiast, R and git active user. I like statistical modelling in general, package and dashboards development, text mining and APIs. I am specially interested in machine learning, Bayesian inference, multivariate analysis, feature extraction and probabilistic graphical models.

What am I doing right now?

Currently, I am pursuing my PhD at the Hamilton Institute in Maynooth, Ireland. I am a part of Andrew Parnell’s research group, and we are mostly working with extensions (Bayesian) tree-based models and its many possible applications.

Apart from that, I am also very interested in Music Information Retrieval. I have developed two packages for music data extraction in R: vagalumeR, which is about getting lyrics data from the Vagalume API, and chorrrds, a package that extracts music chords from the CifraClub website (both available on CRAN). I have contributed to the R package that makes the connection with the Spotify API (found at this repository). More about music data extraction & analysis can be found at my R-Music blog, a blog that I started in 2018 and that has as its main goals to enable the study and practice of Music Information Retrieval (MIR) in R.

I am a member of the R-Ladies (Dublin and São Paulo chapters), a non-profit organization that promotes gender diversity in the R/data science community. I am also a moderator of various data science/programming communities. I am always happy to help people who are interested in the fantastic area of Data Science.

Apart from that, I like to spend my "free time" studying music (focusing on piano), languages and linguistics, cycling and traveling.

Expertises

  • Machine Learning
  • Bayesian Inference
  • Multivariate Analysis
  • Statistical Programming

Interests

  • Data manipulation and visualization
  • Music Informational Retrieval
  • Reproducible research
  • Languages & Culture

Education

  • Current PhD in Statistics & Machine Learning (2018 - )

    Maynooth University

  • First-Class Honours (BSc) in Statistics (2013 - 2018)

    Federal University of Paraná

Skills

R
Machine Learning
Julia
HTML/CSS & LaTeX
Python
C/C++
SQL

Talks and Courses given

2021

  1. DONE An introduction to the tidymodels package,
    • Invited Talk
    • Young-ISA Webinar,
    • Ireland
    • January, 2021
    • Slides [EN]
  2. DONE Cluster-based Quotas for Fairness Improvements in Music Recommendation Systems,
    • Ph.D. Group Meeting,
    • Online,
    • Ireland
    • March, 2021
    • Slides [EN]

2020

  1. DONE rstudio::conf 2020 Summary
    • Short Talk
    • Hamilton Institute,
    • Ireland
    • February, 2020
    • Slides [EN]
  2. DONE I'm a Data Scientist, Git me out of here!,
    • Invited Talk
    • R-Ladies Meetup,
    • Dublin, Ireland
    • February 20th, 2020
    • Slides
  3. DONE Introduction to Julia Programming
    • Ph.D. Group Talk
    • Hamilton Institute,
    • Maynooth, Ireland
    • May 29th, 2020
    • Slides
  4. DONE The tidyverse for Machine Learning
    • R-Ladies Helsinki Meetup,
    • Online,
    • Helsinki
    • June 17th, 2020
    • Slides
  5. DONE How can we forecast epidemic diseases?
    • Data Science in the Covid era Hamilton Institute Talks,
    • Online,
    • October 19th, 2020
    • Slides
  6. DONE Feature Engineering for Genre Characterization in Brazilian Music
    • 13th International Workshop on Machine Learning and Music,
    • Online,
    • September 18th, 2020
    • Slides
  7. DONE Mixture Cure Rate Models for the Analysis of Survival Times in the COVID-19 Scenario
    • Hamilton Institute Students Seminar Series,
    • Online,
    • Maynooth
    • November 17th, 2020
    • Slides

2019

  1. DONE Regularization in Random Forests,
    • Seminar
    • Hamilton Institute,
    • Ireland
    • November, 2019
    • Slides
  2. DONE The tidyverse for Machine Learning,
  3. DONE Bayesian Optimization + Regularization Paths in Random Forests,
    • Presentation,
    • Hamilton Institute,
    • Ireland
    • November 10, 2019
    • Slides, GitHub
  4. DONE Regularization Methods in Random Forests,
  5. DONE An introduction to Random Forests using R,
    • Invited Talk,
    • R-Ladies Meetup,
    • Dublin
    • June 24, 2019
    • Slides, GitHub
  6. DONE Probabilistic Graphical Models in R and python,
    • Invited Talk,
    • IV International Seminar on Statistics with R,
    • Niterói, Brazil
    • May 23, 2019
    • Slides, GitHub
  7. DONE Music Data Analysis in R - Shortcourse,
  8. DONE Regularization Methods in Random Forests,
    • Presentation,
    • 39th Conference on Applied Statistics in Ireland,
    • Dundalk,
    • May 16, 2019
    • Slides, GitHub
  9. DONE An Introduction to Bayesian Regression Trees,
    • Seminar,
    • LEG-UFPR,
    • Curitiba, Paraná, Brazil
    • March 28, 2019
    • Slides, GitHub
  10. DONE Chord Based Feature Engineering for Genre Classification in Popular Brazilian Music
    • Talk,
    • XV School of Regression Models,
    • Goiânia, Brazil
    • March 27, 2019
    • Slides, GitHub
  11. DONE Construction and implementation of multivariate dispersion models,
    • Poster,
    • XV School of Regression Models,
    • Goiânia, Brazil
    • March 25, 2019
    • Slides, GitHub
  12. DONE googleAnalyticsR & purrr,
    • Invited Talk,
    • R-Ladies São Paulo Meetup,
    • São Paulo, Brazil
    • March 21, 2019
    • Slides, GitHub

2018 and before

  1. DONE The steps of a Kaggle project,
    • Presentation,
    • Hamilton Institute,
    • Ireland
    • November 14, 2018
    • Slides, GitHub
  2. DONE Git & GitHub tutorial,
    • Presentation,
    • Hamilton Institute,
    • Ireland
    • October 22, 2018
    • Slides, GitHub
  3. DONE My personal presentation,
    • Presentation,
    • Hamilton Institute,
    • Ireland
    • October 1st, 2018
    • Slides, GitHub
  4. DONE Tutorial: basic R + dplyr,
    • Invited Talk,
    • R-Ladies São Paulo Meetup,
    • São Paulo, Brazil
    • August 13, 2018
    • Slides, GitHub
  5. DONE Tutorial: basic R + dplyr,
    • Invited Talk,
    • R-Ladies Curitiba Meetup,
    • Curitiba, Brazil
    • June 16, 2018
    • Slides, GitHub
  6. DONE Construction and Implementation of Multivariate Dispersion Models
    • Presentation,
    • 63rd Brazilian Meeting of the International Biometrics Society,
    • Curitiba, Brazil
    • May 23, 2018
    • Slides
  7. DONE Music Data Analysis in R,
    • Presentation,
    • 1st R-Day: National Meeting of R Users,
    • Curitiba, Brazil
    • May 21, 2018
    • Slides, Site
  8. DONE RMarkdown Course,