pingouin.rm_corr#

pingouin.rm_corr(data=None, x=None, y=None, subject=None)[source]#

Repeated measures correlation.

Parameters:
datapandas.DataFrame

Dataframe.

x, ystring

Name of columns in data containing the two dependent variables.

subjectstring

Name of column in data containing the subject indicator.

Returns:
statspandas.DataFrame
  • 'r': Repeated measures correlation coefficient

  • 'dof': Degrees of freedom

  • 'pval': p-value

  • 'CI95': 95% parametric confidence intervals

  • 'power': achieved power of the test (= 1 - type II error).

See also

plot_rm_corr

Notes

Repeated measures correlation (rmcorr) is a statistical technique for determining the common within-individual association for paired measures assessed on two or more occasions for multiple individuals.

From Bakdash and Marusich (2017):

Rmcorr accounts for non-independence among observations using analysis of covariance (ANCOVA) to statistically adjust for inter-individual variability. By removing measured variance between-participants, rmcorr provides the best linear fit for each participant using parallel regression lines (the same slope) with varying intercepts. Like a Pearson correlation coefficient, the rmcorr coefficient is bounded by − 1 to 1 and represents the strength of the linear association between two variables.

Results have been tested against the rmcorr R package.

Missing values are automatically removed from the dataframe (listwise deletion).

Examples

>>> import pingouin as pg
>>> df = pg.read_dataset('rm_corr')
>>> pg.rm_corr(data=df, x='pH', y='PacO2', subject='Subject')
               r  dof      pval           CI95%     power
rm_corr -0.50677   38  0.000847  [-0.71, -0.23]  0.929579

Now plot using the pingouin.plot_rm_corr() function:

>>> import pingouin as pg
>>> df = pg.read_dataset('rm_corr')
>>> g = pg.plot_rm_corr(data=df, x='pH', y='PacO2', subject='Subject')
../_images/pingouin-rm_corr-1.png