Bruna Wundervald

Hamilton Institute &
Department of Statistics
Maynooth University

"Those who know, do. Those that understand, teach."

― Aristotle

Welcome!

I am a statistician, reproducible research enthusiast, R and git active user. I like statistical modelling in general, package and dashboards development, text mining and APIs. I am specially interested in machine learning, Bayesian inference, multivariate analysis, feature extraction and probabilistic graphical models.

What am I doing right now?

Currently, I am pursuing my PhD at the Hamilton Institute in Maynooth, Ireland. I am a part of Andrew Parnell’s research group, and we are mostly working with extensions (Bayesian) tree-based models and its many possible applications.

Apart from that, I am also very interested in Music Information Retrieval. I have developed two packages for music data extraction in R: vagalumeR, which is about getting lyrics data from the Vagalume API, and chorrrds, a package that extracts music chords from the CifraClub website (both available on CRAN). I have contributed to the R package that makes the connection with the Spotify API (found at this repository). More about music data extraction & analysis can be found at my R-Music blog, a blog that I started in 2018 and that has as its main goals to enable the study and practice of Music Information Retrieval (MIR) in R.

I am a member of the R-Ladies (Dublin and São Paulo chapters), a non-profit organization that promotes gender diversity in the R/data science community. I am also a moderator of various data science/programming communities. I am always happy to help people who are interested in the fantastic area of Data Science.

Apart from that, I like to spend my "free time" studying music (focusing on piano), languages and linguistics, cycling and traveling.

Expertises

Machine Learning
Bayesian Inference
Multivariate Analysis
Statistical Programming

Interests

Data manipulation and visualization
Music Informational Retrieval
Reproducible research
Languages & Culture

Education

Current PhD in Statistics & Machine Learning (2018 - )

Maynooth University
First-Class Honours (BSc) in Statistics (2013 - 2018)

Federal University of Paraná

Skills

	R
	Machine Learning
	Julia
	HTML/CSS & LaTeX
	Python
	C/C++
	SQL

Laboratório de Estatística e Geoinformação

Talks and Courses given

2021

DONE An introduction to the tidymodels package,
- Invited Talk
- Young-ISA Webinar,
- Ireland
- January, 2021
- Slides [EN]
DONE Cluster-based Quotas for Fairness Improvements in Music Recommendation Systems,
- Ph.D. Group Meeting,
- Online,
- Ireland
- March, 2021
- Slides [EN]

2020

DONE rstudio::conf 2020 Summary
- Short Talk
- Hamilton Institute,
- Ireland
- February, 2020
- Slides [EN]
DONE I'm a Data Scientist, Git me out of here!,
- Invited Talk
- R-Ladies Meetup,
- Dublin, Ireland
- February 20th, 2020
- Slides
DONE Introduction to Julia Programming
- Ph.D. Group Talk
- Hamilton Institute,
- Maynooth, Ireland
- May 29th, 2020
- Slides
DONE The tidyverse for Machine Learning
- R-Ladies Helsinki Meetup,
- Online,
- Helsinki
- June 17th, 2020
- Slides
DONE How can we forecast epidemic diseases?
- Data Science in the Covid era Hamilton Institute Talks,
- Online,
- October 19th, 2020
- Slides
DONE Feature Engineering for Genre Characterization in Brazilian Music
- 13th International Workshop on Machine Learning and Music,
- Online,
- September 18th, 2020
- Slides
DONE Mixture Cure Rate Models for the Analysis of Survival Times in the COVID-19 Scenario
- Hamilton Institute Students Seminar Series,
- Online,
- Maynooth
- November 17th, 2020
- Slides

2019

DONE Regularization in Random Forests,
- Seminar
- Hamilton Institute,
- Ireland
- November, 2019
- Slides

DONE The tidyverse for Machine Learning,
- Invited Talk
- satRday São Paulo, Insper,
- São Paulo, São Paulo, Brazil
- November 30, 2019
- Slides [PT-BR], Slides [EN]

DONE Bayesian Optimization + Regularization Paths in Random Forests,
- Presentation,
- Hamilton Institute,
- Ireland
- November 10, 2019
- Slides, GitHub

DONE Regularization Methods in Random Forests,
- Poster,
- MLSS,
- London
- July 16, 2019
- Poster, GitHub

DONE An introduction to Random Forests using R,
- Invited Talk,
- R-Ladies Meetup,
- Dublin
- June 24, 2019
- Slides, GitHub

DONE Probabilistic Graphical Models in R and python,
- Invited Talk,
- IV International Seminar on Statistics with R,
- Niterói, Brazil
- May 23, 2019
- Slides, GitHub

DONE Music Data Analysis in R - Shortcourse,
- Shortcourse,
- IV International Seminar on Statistics with R,
- Niterói, Brazil
- May 21, 2019
- Slides [PT-BR], Slides [EN]

DONE Regularization Methods in Random Forests,
- Presentation,
- 39th Conference on Applied Statistics in Ireland,
- Dundalk,
- May 16, 2019
- Slides, GitHub

DONE An Introduction to Bayesian Regression Trees,
- Seminar,
- LEG-UFPR,
- Curitiba, Paraná, Brazil
- March 28, 2019
- Slides, GitHub

DONE Chord Based Feature Engineering for Genre Classification in Popular Brazilian Music
- Talk,
- XV School of Regression Models,
- Goiânia, Brazil
- March 27, 2019
- Slides, GitHub

DONE Construction and implementation of multivariate dispersion models,
- Poster,
- XV School of Regression Models,
- Goiânia, Brazil
- March 25, 2019
- Slides, GitHub

DONE googleAnalyticsR & purrr,
- Invited Talk,
- R-Ladies São Paulo Meetup,
- São Paulo, Brazil
- March 21, 2019
- Slides, GitHub

2018 and before

DONE The steps of a Kaggle project,
- Presentation,
- Hamilton Institute,
- Ireland
- November 14, 2018
- Slides, GitHub

DONE Git & GitHub tutorial,
- Presentation,
- Hamilton Institute,
- Ireland
- October 22, 2018
- Slides, GitHub

DONE My personal presentation,
- Presentation,
- Hamilton Institute,
- Ireland
- October 1st, 2018
- Slides, GitHub

DONE Tutorial: basic R + dplyr,
- Invited Talk,
- R-Ladies São Paulo Meetup,
- São Paulo, Brazil
- August 13, 2018
- Slides, GitHub

DONE Tutorial: basic R + dplyr,
- Invited Talk,
- R-Ladies Curitiba Meetup,
- Curitiba, Brazil
- June 16, 2018
- Slides, GitHub

DONE Construction and Implementation of Multivariate Dispersion Models
- Presentation,
- 63rd Brazilian Meeting of the International Biometrics Society,
- Curitiba, Brazil
- May 23, 2018
- Slides

DONE Music Data Analysis in R,
- Presentation,
- 1st R-Day: National Meeting of R Users,
- Curitiba, Brazil
- May 21, 2018
- Slides, Site

DONE RMarkdown Course,
- Shortcourse,
- UFPR,
- Curitiba, Brazil
- August 15, 2017
- Slides, Slides[2]

Template by Bootstrapious. Ported to Hugo by DevCows