| Version: | 1.3-0 | 
| Date: | 2023-11-17 | 
| Title: | Frequency Distribution Tables, Histograms and Polygons | 
| Author: | J. C. Faria [aut, cre], I. B. Allaman [aut], E. G. Jelihovschi [aut] | 
| Maintainer: | J. C. Faria <joseclaudio.faria@gmail.com> | 
| Depends: | R (≥ 2.6.0) | 
| Imports: | xtable | 
| Description: | Perform frequency distribution tables, associated histograms and polygons from vector, data.frame and matrix objects for numerical and categorical variables. | 
| License: | GPL-2 | GPL-3 [expanded from: GPL (≥ 2)] | 
| URL: | https://github.com/jcfaria/fdth | 
| Encoding: | UTF-8 | 
| NeedsCompilation: | no | 
| Packaged: | 2023-11-17 22:02:23 UTC; jcfaria | 
| Repository: | CRAN | 
| Date/Publication: | 2023-11-17 23:00:02 UTC | 
Frequency distribution tables, histograms and polygons
Description
The fdth package contains a set of functions which easily allows
the user to make frequency distribution tables (‘fdt’), its associated
histograms and frequency polygons (absolute, relative and cumulative).
The ‘fdt’ can be formatted in many ways which may be suited to
publication in many different ways (papers, books, etc).
The plot method (S3) is the histogram which can be dealt with the
easiness and flexibility of a high level function.
Details
The frequency of a particular observation is the number of times the observation occurs in the data. The distribution of a variable is the pattern of frequencies of the observation.
Frequency distribution table ‘fdt’ can be used for ordinal, continuous and categorical variables.
The R environment provides a set of functions (generally low level)
enabling the user to perform a ‘fdt’ and the associated graphical representation,
the histogram. A ‘fdt’ plays an important role to summarize data information and 
is the basis for the estimation of probability density function used in 
parametrical inference.
However, for novices or ocasional users of R, it can be laborious to
find out all necessary functions and graphical parameters to do a normalized 
and pretty ‘fdt’ and the associated histogram ready for publications.
That is the aim of this package, i.e, to allow the user easily and flexibly to do
both: the ‘fdt’ and the histogram. The most common input data for univariated is
a vector. For multivariated data can be used both: a data.frame,
in this case also allowing grouping all numerical variables according to one
categorical, or matrices.
The simplest way to run ‘fdt’ and ‘fdt_cat’ is by supplying only the ‘x’
object, for example: d <- fdt(x). In this case all necessary
default values (‘breaks’ and ‘right’) ("Sturges" and FALSE
respectively) will be used, if the ‘x’ object is categorical then just use
d <- fdt_cat(x).
If the varable is of contiuos type, you can also supply:
-  ‘x’ and ‘k’ (number of class intervals); 
-  ‘x’, ‘start’ (left endpoint of the first class interval) and ‘end’ (right endpoint of the last class interval); or 
-  ‘x’, ‘start’, ‘end’ and ‘h’ (class interval width). 
These options make the ‘fdt’ very easy and flexible.
The ‘fdt’ and ‘fdt_cat’ object store information to be used by methods summary,
print and plot. The result of plot is a histogram or
polygon (absolute, relative or cumulative).
The methods summary, print and plot provide a reasonable
set of parameters to format and plot the ‘fdt’ object in a pretty
(and publishable) way.
Author(s)
Faria, J. C. 
Allaman, I. B 
Jelihovschi, E. G.
See Also
hist provided by graphics and
table, cut both provided by base.
Examples
library (fdth)
# Numerical
#======================
# Vectors: univariated
#======================
x <- rnorm(n=1e3,
           mean=5,
           sd=1)
(ft <- fdt(x))
# Histograms
plot(ft)  # Absolute frequency histogram
plot(ft,
     main='My title')
plot(ft,
     x.round=3,
     col='darkgreen')
plot(ft,
     xlas=2)
plot(ft,
     x.round=3,
     xlas=2,
     xlab=NULL)
plot(ft,
     v=TRUE,
     cex=.8,
     x.round=3,
     xlas=2,
     xlab=NULL,
     col=rainbow(11))
plot(ft,
     type='fh')    # Absolute frequency histogram
plot(ft,
     type='rfh')   # Relative frequency histogram
plot(ft,
     type='rfph')  # Relative frequency (%) histogram
plot(ft,
     type='cdh')   # Cumulative density histogram
plot(ft,
     type='cfh')   # Cumulative frequency histogram
plot(ft,
     type='cfph')  # Cumulative frequency (%) histogram
# Polygons
plot(ft,
     type='fp')    # Absolute frequency polygon
plot(ft,
     type='rfp')   # Relative frequency polygon
plot(ft,
     type='rfpp')  # Relative frequency (%) polygon
plot(ft,
     type='cdp')   # Cumulative density polygon
plot(ft,
     type='cfp')   # Cumulative frequency polygon
plot(ft,
     type='cfpp')  # Cumulative frequency (%) polygon
# Density
plot(ft,
     type='d')     # Density
# Summary
ft
summary(ft)  # the same
print(ft)    # the same
show(ft)     # the same
summary(ft,
        format=TRUE)      # It can not be what you want to publications!
summary(ft,
        format=TRUE,
        pattern='%.2f')   # Huumm ..., good, but ... Can it be better?
summary(ft,
        col=c(1:2, 4, 6),
        format=TRUE,
        pattern='%.2f')   # Yes, it can!
range(x)                  # To know x
summary(fdt(x,
            start=1, 
            end=9,
            h=1),
        col=c(1:2, 4, 6),
        format=TRUE,
        pattern='%d')     # Is it nice now?
# The fdt.object
ft[['table']]                        # Stores the feq. dist. table (fdt)
ft[['breaks']]                       # Stores the breaks of fdt
ft[['breaks']]['start']              # Stores the left value of the first class
ft[['breaks']]['end']                # Stores the right value of the last class
ft[['breaks']]['h']                  # Stores the class interval
as.logical(ft[['breaks']]['right'])  # Stores the right option
# Theoretical curve and fdt
y <- rnorm(1e5,
           mean=5, 
           sd=1)
ft <- fdt(y,
          k=100)
plot(ft,
     type='d',                       # density
     col=heat.colors(100))
curve(dnorm(x,
            mean=5, 
            sd=1),
      n=1e3,      
      add=TRUE, 
      lwd=4)
#=============================================
# Data.frames: multivariated with categorical
#=============================================
mdf <- data.frame(X1=rep(LETTERS[1:4], 25),
                  X2=as.factor(rep(1:10, 10)),
                  Y1=c(NA, NA, rnorm(96, 10, 1), NA, NA),
                  Y2=rnorm(100, 60, 4),
                  Y3=rnorm(100, 50, 4),
                  Y4=rnorm(100, 40, 4),
                  stringsAsFactors=TRUE)
#(ft <- fdt(mdf)) # Error message due to presence of NA values
(ft <- fdt(mdf,
           na.rm=TRUE))
# Histograms
plot(ft,
     v=TRUE)
plot(ft,
     col=rainbow(8))
plot(ft,
     type='fh')
plot(ft,
     type='rfh')
plot(ft,
     type='rfph')
plot(ft,
     type='cdh')
plot(ft,
     type='cfh')
plot(ft,
     type='cfph')
# Poligons
plot(ft,
     v=TRUE,
     type='fp')
plot(ft,
     type='rfp')
plot(ft,
     type='rfpp')
plot(ft,
     type='cdp')
plot(ft,
     type='cfp')
plot(ft,
     type='cfpp') 
# Density
plot(ft,
     type='d') 
# Summary
ft
summary(ft)  # the same
print(ft)    # the same
show(ft)     # the same
summary(ft,
        format=TRUE)
summary(ft,
        format=TRUE, 
        pattern='%05.2f')  # regular expression
summary(ft,
        col=c(1:2, 4, 6), 
        format=TRUE,
        pattern='%05.2f')
print(ft,
      col=c(1:2, 4, 6))
print(ft,
      col=c(1:2, 4, 6), 
      format=TRUE,
      pattern='%05.2f')
# Using by
levels(mdf$X1)
plot(fdt(mdf,
         k=5,
         by='X1',
         na.rm=TRUE),
     col=rainbow(5))
levels(mdf$X2)
summary(fdt(iris,
            k=5),
        format=TRUE,
        patter='%04.2f')
plot(fdt(iris,
         k=5),
     col=rainbow(5))
levels(iris$Species)
summary(fdt(iris,
            k=5,
            by='Species'),
        format=TRUE,
        patter='%04.2f')
plot(fdt(iris,
         k=5,
         by='Species'),
     v=TRUE)
#=========================
# Matrices: multivariated
#=========================
summary(fdt(state.x77),
        col=c(1:2, 4, 6),
        format=TRUE)
plot(fdt(state.x77))
# Very big
summary(fdt(volcano,
            right=TRUE),
        col=c(1:2, 4, 6),
        round=3,
        format=TRUE,
        pattern='%05.1f')
plot(fdt(volcano,
         right=TRUE))
## Categorical
x <- sample(x=letters[1:5],
            size=5e2,
            rep=TRUE)
(fdt.c <- fdt_cat(x))
(fdt.c <- fdt_cat(x,
                  sort=FALSE))
#================================================
# Data.frame: multivariated with two categorical
#================================================
mdf <- data.frame(c1=sample(LETTERS[1:3], 1e2, rep=TRUE),
                  c2=as.factor(sample(1:10, 1e2, rep=TRUE)),
                  n1=c(NA, NA, rnorm(96, 10, 1), NA, NA),
                  n2=rnorm(100, 60, 4),
                  n3=rnorm(100, 50, 4),
                  stringsAsFactors=TRUE)
head(mdf)
(fdt.c <- fdt_cat(mdf))
(fdt.c <- fdt_cat(mdf,
                  dec=FALSE))
(fdt.c <- fdt_cat(mdf,
                  sort=FALSE))
(fdt.c <- fdt_cat(mdf,
                  by='c1'))
#================================================
# Matrix: two categorical
#================================================
x <- matrix(sample(x=letters[1:10],
                   size=100,
                   rep=TRUE),
            nc=2,
            dimnames=list(NULL,
                          c('c1', 'c2')))
head(x)
(fdt.c <- fdt_cat(x))
Frequency distribution table for numerical data
Description
A S3 set of methods to easily perform frequency distribution table (‘fdt’) from
vector, data.frame and matrix objects.
Usage
## S3 generic
fdt(x, ...)
## S3 methods
## Default S3 method:
fdt(x,
    k,
    start,
    end,
    h,
    breaks=c('Sturges', 'Scott', 'FD'),
    right=FALSE,
    na.rm=FALSE, ...)
## S3 method for class 'data.frame'
fdt(x,
    k,
    by,
    breaks=c('Sturges', 'Scott', 'FD'),
    right=FALSE,
    na.rm=FALSE, ...)
## S3 method for class 'matrix'
fdt(x,
    k,
    breaks=c('Sturges', 'Scott', 'FD'),
    right=FALSE,
    na.rm=FALSE, ...)
Arguments
| x | a  | 
| k | number of class intervals. | 
| start | left endpoint of the first class interval. | 
| end | right endpoint of the last class interval. | 
| h | class interval width. | 
| by | categorical variable used for grouping each numeric variable,
useful only on  | 
| breaks | method used to determine the number of interval classes, c(“Sturges”, “Scott”, “FD”). | 
| right | right endpoints open (default =  | 
| na.rm | logical. Should missing values be removed? (default =  | 
| ... | potencial further arguments (required by generic). | 
Details
The simplest way to run ‘fdt’ is done by supplying only the ‘x’
object, for example: nm <- fdt(x). In this case all necessary
default values (‘breaks’ and ‘right’) (“Sturges” and FALSE
respectively) will be used.
It can be provided also:
-  ‘x’ and ‘k’ (number of class intervals); 
-  ‘x’, ‘start’ (left endpoint of the first class interval) and ‘end’ (right endpoint of the last class interval); or 
-  ‘x’, ‘start’, ‘end’ and ‘h’ (class interval width). 
These options make the ‘fdt’ very easy and flexible.
The ‘fdt’ object stores information to be used by methods summary,
print, plot, mean, median and mfv. The result of plot is a histogram.
The methods summary, print and plot provide a reasonable
set of parameters to format and plot the ‘fdt’ object in a pretty
(and publishable) way.
Value
For fdt the method fdt.default returns a list of class fdt.default with the slots:
| \samp{table} | A  | 
| \samp{breaks} | A  | 
| \samp{data} | A vector of the data ‘x’ provided. | 
The methods fdt.data.frame and fdt.matrix
return a list of class fdt.multiple.
This list has one slot for each numeric (fdt)
variable of the ‘x’ provided. Each slot, corresponding to each numeric
variable, stores the same slots of the fdt.default described above.
Author(s)
Faria, J. C. 
Allaman, I. B 
Jelihovschi, E. G.
See Also
hist provided by graphics and
table, cut both provided by base.
Examples
library(fdth)
#========
# Vector
#========
x <- rnorm(n=1e3,
           mean=5,
           sd=1)
str(x)
# x
(ft <- fdt(x))
# x, alternative breaks
(ft <- fdt(x,
           breaks='Scott'))
# x, k
(ft <- fdt(x,
           k=10))
# x, star, end
range(x)
(ft <- fdt(x,
           start=floor(min(x)),
           end=floor(max(x) + 1)))
# x, start, end, h
(ft <- fdt(x,
           start=floor(min(x)),
           end=floor(max(x) + 1),
           h=1))
# Effect of right
sort(x <- rep(1:3, 3))
(ft <- fdt(x,
           start=1,
           end=4,
           h=1))
(ft <- fdt(x,
           start=0,
           end=3,
           h=1,
           right=TRUE))
#================================================
# Data.frame: multivariated with two categorical
#================================================
mdf <- data.frame(c1=sample(LETTERS[1:3], 1e2, TRUE),
                  c2=as.factor(sample(1:10, 1e2, TRUE)),
                  n1=c(NA, NA, rnorm(96, 10, 1), NA, NA),
                  n2=rnorm(100, 60, 4),
                  n3=rnorm(100, 50, 4),
                  stringsAsFactors=TRUE)
head(mdf)
#(ft <- fdt(mdf))  # Error message due to presence of NA values
(ft <- fdt(mdf,
           na.rm=TRUE))
str(mdf)
# By factor
(ft <- fdt(mdf,
           k=5,
           by='c1',
           na.rm=TRUE))
# choose FD criteria               
(ft <- fdt(mdf,
           breaks='FD',
           by='c1',
           na.rm=TRUE))
# k
(ft <- fdt(mdf,
           k=5,
           by='c2',
           na.rm=TRUE))
(ft <- fdt(iris,
           k=10))
(ft <- fdt(iris,
           k=5,
           by='Species'))
#=========================
# Matrices: multivariated
#=========================
(ft <-fdt(state.x77))
summary(ft,
        format=TRUE)
summary(ft,
        format=TRUE,
        pattern='%.2f')
Frequency distribution table for categorical data
Description
A S3 set of methods to easily perform categorical frequency distribution table (‘fdt_cat’) from
vector, data.frame and matrix objects.
Usage
## S3 generic
fdt_cat(x, ...)
## S3 methods
## Default S3 method:
fdt_cat(x,
        sort=TRUE,
        decreasing=TRUE, ...)
## S3 method for class 'data.frame'
fdt_cat(x,
        by,
        sort=TRUE,
        decreasing=TRUE, ...) 
## S3 method for class 'matrix'
fdt_cat(x,
        sort=TRUE,
        decreasing=TRUE, ...)
Arguments
| x | a  | 
| by | categorical variable used for grouping each categorical response,
useful only on  | 
| sort | logical. Should the  | 
| decreasing | logical. Should the sort order be increasing or decreasing?
(default =  | 
| ... | optional further arguments (required by generic). | 
Details
The simplest way to run ‘fdt_cat’ is supplying only the ‘x’
object, for example: ct <- fdt_cat(x). In this case all necessary
default values (‘sort = TRUE’ and ‘decreasing = TRUE’) will be used.
These options make the ‘fdt_cat’ very easy and flexible.
The ‘fdt_cat’ object stores information to be used by methods summary,
print, plot and mfv. The result of plot is a bar plot.
The methods summary.fdt_cat, print.fdt_cat and plot.fdt_cat provide a reasonable
set of parameters to format and plot the ‘fdt_cat’ object in a pretty
(and publishable) way.
Value
For fdt_cat the method fdt_cat.default returns a data.frame storing the ‘fdt’.
The methods fdt_cat.data.frame and fdt_cat.matrix 
return a list of class fdt_cat..multiple.
This list has one slot for each categorical
variable of the  supplied ‘x’. Each slot, corresponding to each categorical
variable, stores the same slots of the fdt_cat.default described above.
Author(s)
Faria, J. C. 
Allaman, I. B 
Jelihovschi, E. G.
See Also
hist provided by graphics and
table, cut both provided by base.
Examples
library(fdth)
# Categorical
x <- sample(x=letters[1:5],
            size=5e2,
            rep=TRUE)
table(x)
(ft.c <- fdt_cat(x))
(ft.c <- fdt_cat(x,
                 sort=FALSE))
#================================================
# Data.frame: multivariated with two categorical
#================================================
mdf <- data.frame(c1=sample(LETTERS[1:3], 1e2, rep=TRUE),
                  c2=as.factor(sample(1:10, 1e2, rep=TRUE)),
                  n1=c(NA, NA, rnorm(96, 10, 1), NA, NA),
                  n2=rnorm(100, 60, 4),
                  n3=rnorm(100, 50, 4),
                  stringsAsFactors=TRUE)
head(mdf)
(ft.c <- fdt_cat(mdf))
(ft.c <- fdt_cat(mdf,
                 dec=FALSE))
(ft.c <- fdt_cat(mdf,
                 sort=FALSE))
(ft.c <- fdt_cat(mdf,
                 by='c1'))
#================================================
# Matrix: two categorical
#================================================
x <- matrix(sample(x=letters[1:10],
                   size=100,
                   rep=TRUE),
            nc=2,
            dimnames=list(NULL,
                          c('c1', 'c2')))
head(x)
(ft.c <- fdt_cat(x))
Internal fdth functions
Description
Internal fdth functions.
Details
These are not to be called by the user and are undocumented.
Frequency distribution table for continuous and categorical variables
Description
Makes a full fdt from a minimal set of information. 
Useful to reproduce (when the real data vector is not known) a previous fdt.
Usage
make.fdt(f,
         start,
         end,
         right=FALSE)
make.fdt_cat(f,
             categories=NULL,
             sort=TRUE,
             decreasing=TRUE)
Arguments
| f | a numeric  | 
| start | the left value of the interval of the first class. | 
| end | the last value of the interval of the last class. | 
| right | intervals right open (default =  | 
| categories | the levels of the categorical variable. | 
| sort | the levels of the categorical variable will be ordered by frequency. The default is  | 
| decreasing | if sort argument is  | 
Details
 Given the starting and ending values of the continuous variable table 
or the levels of the categorical variable plus the number of intervals and the 
absolute frequency values the functions make.fdt and make.fdt_cat
reconstruct whole fdt or fdt_cat table.   
Value
The function make.fdt returns a list with the slots:
| table | a  | 
| breaks | a  | 
The function make.fdt_cat returns a list whith the slots:
| Category | the levels of the categorical variable. | 
| f | absolute frequency,  | 
| rf | relative frequency,  | 
| rf(%) | relative frequency in percentages,  | 
| cf | cumulative frequency;  | 
| cf(%) | cumulative frequency in percentages,  | 
Author(s)
Faria, J. C. 
Allaman, I. B 
Jelihovschi, E. G.
See Also
table and cut provided by base package.
Examples
# Numeric
# Making one reference fdt
set.seed(33)
x <- rnorm(1e3, 
           20, 
           2)
(ft.r <- fdt(x))
# Making a brand new
(ft.n  <- make.fdt(f=ft.r$table$f,
                   start=13.711,
                   end=27.229))   # Huumm ..., good, but ... Can it be better? 
summary(ft.n,
        format=TRUE,
        pattern='%.3f')           # Is it nice now?
# Categorical
x <- sample(letters[1:5], 
            1e3, 
            rep=TRUE)
# Making one reference fdt
(ft.r <- fdt_cat(x))
# Making a brand new
(ft.n <- make.fdt_cat(f=ft.r$f,
                      categ=ft.r$Category))
Mean of frequency distribution table (numerical variable)
Description
S3 method for the arithmetic mean of a fdt. 
 
Useful to estimate the arithmetic mean (when the real data vector is not known) from a previous fdt.
Usage
## S3 method: numerical 
## S3 method for class 'fdt'
mean(x, ...)
Arguments
| x | a  | 
| ... | required by generic. | 
Details
mean.fdt calculates the mean value based on a known formula using 
the midpoint of each interval class. mean.fdt.multiple calls mean.fdt
for each variable, that is, each column of the data.frame.
Value
mean.fdt returns a numeric vector containing the mean value of the fdt.
mean.fdt.multiple returns a list, where each element is a numeric vector 
containing the mean value of the fdt for each variable.
Author(s)
Faria, J. C. 
Allaman, I. B 
Jelihovschi, E. G.
See Also
median.fdt, mfv.
Examples
mdf <- data.frame(x=rnorm(1e3, 
                          20, 
                          2),
                  y=rnorm(1e3, 
                          30,
                          3),
                  z=rnorm(1e3,
                          40,
                          4))
head(mdf)
apply(mdf,
      2,
      mean)
mean(fdt(mdf))      
Median of frequency distribution table (numerical variable)
Description
S3 method for the median of a fdt. 
 
Useful to estimate the median (when the real data vector is not known) from a previous fdt.
Usage
## S3 method: numerical 
## S3 method for class 'fdt'
median(x, ...)
Arguments
| x | a  | 
| ... | required by generic. | 
Details
median.fdt calculates the value of the median based on a known formula. 
median.fdt.multiple calls mean.fdtfor each variable, that is, each column of the data.frame.
Value
mean.fdt returns a numeric vector containing the value of the median of the fdt.
median.fdt.multiple returns a list, where each element is a numeric vector 
containing the value of the median of the fdt for each variable.
Author(s)
Faria, J. C. 
Allaman, I. B 
Jelihovschi, E. G.
See Also
mean.fdt, mfv.
Examples
mdf <- data.frame(x=rnorm(1e3, 
                          20, 
                          2),
                  y=rnorm(1e3, 
                          30,
                          3),
                  z=rnorm(1e3,
                          40,
                          4))
head(mdf)
apply(mdf,
      2,
      median)
median(fdt(mdf))      
Most frequent value (statistical mode) of frequency distribution table (numerical and categorical variable)
Description
S3 methods for the most frequent value (statistical mode) of a fdt. 
 
Useful to estimate the most frequent value or statistical mode. May also be used, by using a previous fdt, when the original data vector is not known.
Usage
## S3 generic
mfv(x, ...)
## S3 methods: numerical and categorical 
## Default S3 method:
mfv(x, ...)
## S3 method for class 'fdt'
mfv(x, ...)
## S3 method for class 'fdt.multiple'
mfv(x, ...)
## S3 method for class 'fdt_cat'
mfv(x, ...)
## S3 method for class 'fdt_cat.multiple'
mfv(x, ...)
Arguments
| x | a  | 
| ... | required to be generic. | 
Details
mfv.fdt and mfv.fdt_cat calculates the most frequent value (mfv) based on a known formula. 
mfv.fdt.multiple and mfv.fdt_cat.multiplecall respectively mfv.fdt 
or mfv.fdt_catfor each variable, that is, each column of the data.frame.
Value
mfv.fdt returns a numeric vector containing the mfv value of the fdt.
mean.fdt.multiple returns a list, where each element is a numeric vector 
containing the mean value of the fdt for each variable.
mfv.fdt_cat returns a character vector containing the mfv value of the fdt_cat.
mean.fdt_cat.multiple returns a list, where each element is a character vector 
containing the mfv value of the fdt_cat for each variable.
Author(s)
Faria, J. C. 
Allaman, I. B 
Jelihovschi, E. G.
See Also
mean.fdt, median.fdt.
Examples
# Numerical
mdf <- data.frame(x=rnorm(1e2, 
                          20, 
                          2),
                  y=rnorm(1e2, 
                          30,
                          3),
                  z=rnorm(1e2,
                          40,
                          4))
head(mdf)
mfv(mdf$x)   # From vector x
mfv(mdf$y)   # From vector y
mfv(mdf$z)   # From vector z
(ft <- fdt(mdf))
mfv(ft)      # From agruped data in a fdt object
## Categorical
mdf <- data.frame(c1=sample(letters[1:5],
                            1e3,
                            rep=TRUE),
                  c2=sample(letters[6:10],
                            1e3,
                            rep=TRUE),
                  c3=sample(letters[11:21],
                            1e3,
                            rep=TRUE),
                  stringsAsFactors=TRUE)
head(mdf)
mfv(mdf$c1)   # From vector c1
mfv(mdf$c2)   # From vector c2
mfv(mdf$c3)   # From vector c3
(ft <- fdt_cat(mdf))
mfv(ft)      # From agruped data in a fdt object
Plot fdt.default and fdt.multiple objects
Description
S3 methods for fdt.default and fdt.multiple objects.
It is possible to plot histograms and polygons (absolute, relative
and cumulative).
Usage
## S3 methods  
## S3 method for class 'fdt.default'
plot(x,
     type=c('fh', 'fp', 
            'rfh', 'rfp', 
            'rfph', 'rfpp', 
            'd',
            'cdh', 'cdp', 
            'cfh', 'cfp', 
            'cfph', 'cfpp'),
     v=FALSE,
     v.round=2,
     v.pos=3,
     xlab="Class limits",
     xlas=0,
     ylab=NULL,
     col="gray",
     xlim=NULL,
     ylim=NULL,
     main=NULL,
     x.round=2, ...)
## S3 method for class 'fdt.multiple'
plot(x,
     type=c('fh', 'fp', 
            'rfh', 'rfp', 
            'rfph', 'rfpp', 
            'd',
            'cdh', 'cdp', 
            'cfh', 'cfp', 
            'cfph', 'cfpp'),
     v=FALSE,
     v.round=2,
     v.pos=3,
     xlab="Class limits",
     xlas=0,
     ylab=NULL, 
     col="gray",
     xlim=NULL,
     ylim=NULL,
     main=NULL,
     main.vars=TRUE,
     x.round=2, 
     grouped=FALSE,
     args.legend=NULL, ...)
## S3 method for class 'fdt_cat.default'
plot(x,
     type=c('fb', 'fp', 'fd',
            'rfb', 'rfp', 'rfd',
            'rfpb', 'rfpp', 'rfpd',
            'cfb', 'cfp', 'cfd',
            'cfpb', 'cfpp', 'cfpd',
            'pa'),
     v=FALSE,
     v.round=2,
     v.pos=3,
     xlab=NULL,
     xlas=0,
     ylab=NULL,
     y2lab=NULL,
     y2cfp=seq(0, 100, 25),
     col=gray(.4),
     xlim=NULL,
     ylim=NULL,
     main=NULL,
     box=FALSE, ...)
## S3 method for class 'fdt_cat.multiple'
plot(x,
     type=c('fb', 'fp', 'fd',
            'rfb', 'rfp', 'rfd',
            'rfpb', 'rfpp', 'rfpd',
            'cfb', 'cfp', 'cfd',
            'cfpb', 'cfpp', 'cfpd',
            'pa'),
     v=FALSE,
     v.round=2,
     v.pos=3,
     xlab=NULL,
     xlas=0,
     ylab=NULL, 
     y2lab=NULL,
     y2cfp=seq(0, 100, 25),
     col=gray(.4),
     xlim=NULL,
     ylim=NULL,
     main=NULL,
     main.vars=TRUE,
     box=FALSE, ...)
Arguments
| x | A ‘fdt’ object. | 
| type | the type of the plot:  ‘rfb:’ relative frequency barplot,  ‘rfpb:’ relative frequency (%) barplot,  ‘d:’ density,  ‘cfb:’ cumulative frequency barplot,  ‘cdpb:’ cumulative frequency (%) barplot,  ‘pa:’ pareto chart. | 
| v | logical flag: should the values be added to the plot? | 
| v.round | if  | 
| v.pos | if  | 
| xlab | a label for the ‘x’ axis. | 
| xlas | an integer which controls the orientation of the ‘x’ axis labels:  | 
| ylab | a label for the ‘y’ axis. | 
| y2lab | a label for the ‘y2’ axis. | 
| y2cfp | a cumulative percent frequency for the ‘y2’ axis. The default is  | 
| col | a  | 
| xlim | the ‘x’ limits of the plot. | 
| ylim | the ‘y’ limits of the plot. | 
| main | title of the plot(s). This option has priority over ‘main.vars’, i.e, if any value is informed,
the variable names will not be used as title of the plot(s). For  | 
| main.vars | logical flag: should the variables names be added as title of each plot (default  | 
| x.round | a numeric value to round the ‘x’ ticks:
‘0:’ parallel to the axes,  | 
| box | if  | 
| grouped | if  | 
| args.legend | list of additional arguments to be passed to  | 
| ... | optional plotting parameters. | 
Details
The result is a single histogram or polygon (absolute, relative or cumulative)
for fdt.default or a set of histograms or polygon (absolute, relative or
cumulative) for fdt.multiple objects.
Both ‘default and multiple’ try to compute the maximum number of histograms
that will fit on one page, then it draws a matrix of histograms. More than one
graphical device may be opened to show all histograms.
The result is a single barplot, polygon, dotchar (absolute, relative or cumulative)
and Pareto chart for fdt_cat.default or a set of the same graphs for 
fdt_cat.multiple objects.
Both ‘default and multiple’ try to compute the maximum number of histograms
that will fit on one page, then it draws a matrix of graphs lited above. More than one
graphical device may be opened to show all graphs.
Author(s)
Faria, J. C. 
Allaman, I. B 
Jelihovschi, E. G.
Examples
library(fdth)
#================================
# Vectors: univariated numerical
#================================
x <- rnorm(n=1e3,
           mean=5,
           sd=1)
(ft <- fdt(x))
# Histograms
plot(ft)  # Absolute frequency histogram
plot(ft,
     main='My title')
plot(ft,
     x.round=3,
     col='darkgreen')
plot(ft,
     xlas=2)
plot(ft,
     x.round=3,
     xlas=2,
     xlab=NULL)
plot(ft,
     v=TRUE, 
     cex=.8,
     x.round=3,
     xlas=2,
     xlab=NULL,
     col=rainbow(11))
plot(ft,
     type='fh')    # Absolute frequency histogram
plot(ft,
     type='rfh')   # Relative frequency histogram
plot(ft,
     type='rfph')  # Relative frequency (%) histogram
plot(ft,
     type='cdh')   # Cumulative density histogram
plot(ft,
     type='cfh')   # Cumulative frequency histogram
plot(ft,
     type='cfph')  # Cumulative frequency (%) histogram
# Poligons
plot(ft,
     type='fp')    # Absolute frequency polygon
plot(ft,
     type='rfp')   # Relative frequency polygon
plot(ft,
     type='rfpp')  # Relative frequency (%) polygon
plot(ft,
     type='cdp')   # Cumulative density polygon
plot(ft,
     type='cfp')   # Cumulative frequency polygon
plot(ft,
     type='cfpp')  # Cumulative frequency (%) polygon
# Density
plot(ft,
     type='d')     # Density
# Theoretical curve and fdt
x <- rnorm(1e5,
           mean=5,
           sd=1)
plot(fdt(x,
         k=100), 
     type='d', 
     col=heat.colors(100))
curve(dnorm(x,
            mean=5,
            sd=1),
      col='darkgreen',
      add=TRUE,
      lwd=2)
#==================================
# Vectors: univariated categorical
#==================================
x <- sample(letters[1:5],
            1e3,
            rep=TRUE)
(ft.c <- fdt_cat(x))
# Barplot: the default
plot(ft.c)
# Barplot
plot(ft.c,
     type='fb')
# Polygon
plot(ft.c,
     type='fp')
# Dotchart
plot(ft.c,
     type='fd')
# Pareto chart
plot(ft.c,
     type='pa')
#=============================================
# Data.frames: multivariated with categorical
#=============================================
mdf <- data.frame(X1=rep(LETTERS[1:4], 25),
                  X2=as.factor(rep(1:10, 10)),
                  Y1=c(NA, NA, rnorm(96, 10, 1), NA, NA),
                  Y2=rnorm(100, 60, 4),
                  Y3=rnorm(100, 50, 4),
                  Y4=rnorm(100, 40, 4),
                  stringsAsFactors=TRUE)
# Histograms
(ft <- fdt(mdf,
           na.rm=TRUE))
plot(ft,
     v=TRUE,
     cex=.8)
plot(ft,
     col='darkgreen',
     ylim=c(0, 40))
plot(ft,
     col=rainbow(8), 
     ylim=c(0, 40),
     main=LETTERS[1:4])
plot(ft,
     type='fh')
plot(ft,
     type='rfh')
plot(ft,
     type='rfph')
plot(ft,
     type='cdh')
plot(ft,
     type='cfh')
plot(ft,
     type='cfph')
# Poligons
plot(ft,
     v=TRUE,
     type='fp')
plot(ft,
     type='rfp')
plot(ft,
     type='rfpp')
plot(ft,
     type='cdp')
plot(ft,
     type='cfp')
plot(ft,
     type='cfpp') 
# Density
plot(ft,
     type='d')     
levels(mdf$X1)
plot(fdt(mdf,
         k=5,
         by='X1',
         na.rm=TRUE),
     ylim=c(0, 12))
levels(mdf$X2)
plot(fdt(mdf,
         breaks='FD', 
         by='X2',
         na.rm=TRUE))
plot(fdt(mdf,
         k=5,
         by='X2',
         na.rm=TRUE))  # It is difficult to compare
plot(fdt(mdf,
         k=5,
         by='X2',
         na.rm=TRUE),
     ylim=c(0, 8))  # Easy
plot(fdt(iris,
         k=5))
plot(fdt(iris,
         k=5),
     col=rainbow(5))
plot(fdt(iris, 
         k=5,
         by='Species'),
     v=TRUE)
ft <- fdt(iris,
          k=10)
plot(ft)
plot(ft,
     type='d')
# Categorical data 
(ft.c <- fdt_cat(mdf))
plot(ft.c)
plot(ft.c,
     type='fd',
     pch=19)
#=========================
# Matrices: multivariated
#=========================
plot(fdt(state.x77))
plot(fdt(volcano))
Print methods for fdt objects
Description
S3 methods to return a data.frame (the frequency distribution table - fdt) 
for fdt.default and fdt.multiple objects; data.frame (the frequency
distribution table - fdt_cat) for fdt_cat.default and fdt_cat.multiple
objects.
Usage
## S3 methods
## S3 method for class 'fdt.default'
print(x,
      columns=1:6,
      round=2,
      format.classes=FALSE,
      pattern='%09.3e',
      row.names=FALSE,
      right=TRUE, ...)
## S3 method for class 'fdt.multiple'
print(x,
      columns=1:6,
      round=2,
      format.classes=FALSE,
      pattern='%09.3e',
      row.names=FALSE,
      right=TRUE, ...)
## S3 method for class 'fdt_cat.default'
print(x,
      columns=1:6,
      round=2,
      row.names=FALSE,
      right=TRUE, ...)
## S3 method for class 'fdt_cat.multiple'
print(x,
      columns=1:6,
      round=2,
      row.names=FALSE,
      right=TRUE, ...)
Arguments
| x | a ‘fdt’ object. | 
| columns | a  | 
| round | rounds ‘fdt’ columns to the specified number of decimal places (default 2). | 
| format.classes | logical, if  | 
| pattern | same as  | 
| row.names | logical (or character vector), indicating whether (or what)
row names should be printed. The default is  | 
| right | logical, indicating whether or not strings should be right-aligned. The default is right-alignment. | 
| ... | potential further arguments (require by generic). | 
Details
For print.fdt, it is possible to select what columns of the table 
(a data.frame) will be shown, as well as the pattern of the first column,
for print.fdt_cat it is only possible to select what columns of the table 
(a data.frame) will be shown. The columns are:
-  ‘Class limits’ 
-  ‘f’ - absolute frequency 
-  ‘rf’ - relative frequency 
-  ‘rf(%)’ - relative frequency, % 
-  ‘cf’ - cumulative frequency 
-  ‘cf(%)’ - cumulative frequency, % 
The available parameters offer an easy and powerful way to format the ‘fdt’ for publications and other purposes.
Value
A single data.frame for fdt.default and fdt.default or multiple
data.frames for fdt.multiple and fdt_cat.multiple.
Author(s)
Faria, J. C. 
Allaman, I. B 
Jelihovschi, E. G.
Examples
library (fdth)
#======================
# Vectors: univariated
#======================
set.seed(1)
x <- rnorm(n=1e3,
           mean=5,
           sd=1)
ft <- fdt(x)
str(ft)
ft
print(ft) # the same
print(ft,
      format=TRUE)      # It can not be what you want to publications!
print(ft,
      format=TRUE,
      pattern='%.2f')   # Huumm ..., good, but ... Can it be better?
print(ft,
      col=c(1:2, 4, 6),
      format=TRUE, 
      pattern='%.2f')   # Yes, it can!
range(x)                 # To know x
print(fdt(x,
          start=1,
          end=9,
          h=1),
      col=c(1:2, 4, 6),
      format=TRUE,
      pattern='%d')     # Is it nice now?
ft[['table']]                        # Stores the feq. dist. table (fdt)
ft[['breaks']]                       # Stores the breaks of fdt
ft[['breaks']]['start']              # Stores the left value of the first class
ft[['breaks']]['end']                # Stores the right value of the last class
ft[['breaks']]['h']                  # Stores the class interval
as.logical(ft[['breaks']]['right'])  # Stores the right option
#=============================================
# Data.frames: multivariated with categorical
#=============================================
mdf <- data.frame(X1=rep(LETTERS[1:4], 25),
                  X2=as.factor(rep(1:10, 10)),
                  Y1=c(NA, NA, rnorm(96, 10, 1), NA, NA),
                  Y2=rnorm(100, 60, 4),
                  Y3=rnorm(100, 50, 4),
                  Y4=rnorm(100, 40, 4),
                  stringsAsFactors=TRUE)
                  
(ft <- fdt_cat(mdf))
                  
print(ft)
(ft <- fdt(mdf,
         na.rm=TRUE))
print(ft)
str(ft)
print(ft, # the s
      format=TRUE)
print(ft,
      format=TRUE,
      pattern='%05.2f')  # regular expression
print(ft,
      col=c(1:2, 4, 6), 
      format=TRUE, 
      pattern='%05.2f')
print(ft,
      col=c(1:2, 4, 6))
print(ft,
      col=c(1:2, 4, 6),
      format=TRUE,
      pattern='%05.2f')
levels(mdf$X1)
print(fdt(mdf,
          k=5,
          by='X1',
         na.rm=TRUE))
levels(mdf$X2)
print(fdt(mdf,
          breaks='FD',
          by='X2',
          na.rm=TRUE),
      round=3)
print(fdt(mdf,
          k=5,
          by='X2',
          na.rm=TRUE),
      format=TRUE,
      round=3)
print(fdt(iris,
          k=5),
      format=TRUE,
      patter='%04.2f')
levels(iris$Species)
print(fdt(iris,
          k=5,
          by='Species'),
      format=TRUE, 
      patter='%04.2f')
#=========================
# Matrices: multivariated
#=========================
print(fdt(state.x77), 
      col=c(1:2, 4, 6),
      format=TRUE)
print(fdt(volcano,
          right=TRUE),
      col=c(1:2, 4, 6),
      round=3,
      format=TRUE,
      pattern='%05.1f')
Quantile of frequency distribution table (numerical variable)
Description
S3 methods for the quantile of a fdt. 
 
Useful to estimate the quantile (when the real data vector is not known) from a previous fdt.
Usage
## S3 methods: numerical
## S3 method for class 'fdt'
quantile(x,
         ...,
         i=1,
         probs=seq(0, 1, 0.25))
## S3 method for class 'fdt.multiple'
quantile(x, ...) 
Arguments
| x | a  | 
| i | a vector of length up to the length of probs | 
| probs | vector of probabilities defining the quantiles | 
| ... | potencial further arguments (required by generic). | 
Details
quantile.fdt calculates the quantiles based on a known formula for 
class intervals. quantile.fdt.multiple calls quantile.fdt
for each variable, that is, each column of the data.frame.
Value
quantile.fdt returns a numeric vector containing the value(s) of the 
quantile(s) from fdt.
quantile.fdt.multiple returns a list, where each element is a numeric vector 
containing the quantile(s) of the fdt for each variable.
Author(s)
Faria, J. C. 
Allaman, I. B 
Jelihovschi, E. G.
See Also
median.fdt, var.fdt.
Examples
mdf <- data.frame(x=rnorm(1e2, 
                          20, 
                          2),
                  y=rnorm(1e2, 
                          30,
                          3),
                  z=rnorm(1e2,
                          40,
                          4))
head(mdf)
# From data.frame
apply(mdf,
      2,
      quantile)[-c(1,4), ]
# From fdt
quantile(fdt(mdf))               # Notice that the i default is 1 (the first quartile)
## A small (but didactic) joke  
quantile(fdt(mdf),
         i=2,
         probs=seq(0, 
                   1, 
                   0.25))        # The quartile 2
quantile(fdt(mdf),
         i=5,
         probs=seq(0, 
                   1, 
                   0.10))        # The decile 5 
quantile(fdt(mdf),
         i=50,
         probs=seq(0, 
                   1, 
                   0.01))        # The percentile 50
quantile(fdt(mdf),
         i=500,
         probs=seq(0, 
                   1, 
                   0.001))       # The permile 500
median(fdt(mdf))                 # The median (all the results are the same) ;)
# More than one quantile
ql <- numeric()
for(i in 1:3)
  ql[i] <- quantile(fdt(mdf$x),
                    i=i,
                    probs=seq(0,
                              1,
                              0.25))  # The tree quartiles
names(ql) <- paste0(c(25,
                      50,
                      75),
                    '%')
round(ql,
      2)
Standard deviation of frequency distribution table (numerical variable)
Description
S3 methods for the standard deviation of a fdt. 
 
Useful to estimate the standard deviation (when the real data vector is not known) from a previous fdt.
Usage
## S3 generic
sd(x, ...)
## S3 methods: numerical
## Default S3 method:
sd(x, ...)
## S3 method for class 'fdt'
sd(x, ...)
## S3 method for class 'fdt.multiple'
sd(x, ...)
Arguments
| x | a  | 
| ... | required to be generic. | 
Details
sd.fdt calculates the value of the variance based on a known formula. 
sd.fdt.multiple calls sd.fdtfor each variable, that is, each column of the data.frame.
Value
sd.fdt returns a numeric vector containing the value of the median of the fdt.
median.fdt.multiple returns a list, where each element is a numeric vector 
containing the value of the variance of the fdt for each variable.
Author(s)
Faria, J. C. 
Allaman, I. B 
Jelihovschi, E. G.
See Also
var.fdt, mean.fdt.
Examples
mdf <- data.frame(x=rnorm(1e3, 
                          20, 
                          2),
                  y=rnorm(1e3, 
                          30,
                          3),
                  z=rnorm(1e3,
                          40,
                          4))
head(mdf)
apply(mdf,
      2,
      sd)
sd(fdt(mdf))      
Summary methods for fdt objects
Description
S3 methods to return a data.frame (the frequency
distribution table - ‘fdt’) for fdt.default, fdt.multiple,
fdt_cat.default and fdt_cat.multiple objects.
Usage
## S3 methods
## S3 method for class 'fdt.default'
summary(object,
        columns=1:6,
        round=2,
        format.classes=FALSE,
        pattern="%09.3e",
        row.names=FALSE,
        right=TRUE, ...)
## S3 method for class 'fdt.multiple'
summary(object,
        columns=1:6,
        round=2,
        format.classes=FALSE,
        pattern="%09.3e",
        row.names=FALSE,
        right=TRUE, ...)
## S3 method for class 'fdt_cat.default'
summary(object,
        columns=1:6,
        round=2,
        row.names=FALSE,
        right=TRUE, ...)
## S3 method for class 'fdt_cat.multiple'
summary(object,
        columns=1:6,
        round=2,
        row.names=FALSE,
        right=TRUE, ...)
Arguments
| object | a  | 
| columns | a  | 
| round | rounds ‘fdt’ columns to the specified number of decimal places (default 2). | 
| format.classes | logical, if  | 
| pattern | same as  | 
| row.names | logical (or character vector), indicating whether (or what)
row names should be printed. The default is  | 
| right | logical, indicating whether or not strings should be right-aligned. The default is right-alignment. | 
| ... | optional further arguments (require by generic). | 
Details
It is possible to select what columns of the table (a data.frame)
will be shown, as well as the pattern of the first column. The columns are:
-  ‘Class limits’ 
-  ‘f’ - absolute frequency 
-  ‘rf’ - relative frequency 
-  ‘rf(%)’ - relative frequency, % 
-  ‘cf’ - cumulative frequency 
-  ‘cf(%)’ - cumulative frequency, % 
The available parameters offer an easy and powerful way to format the ‘fdt’ for publications and other purposes.
Value
A single data.frame for fdt.default or multiple
data.frames for fdt.multiple.
Author(s)
Faria, J. C. 
Allaman, I. B 
Jelihovschi, E. G.
Examples
library (fdth)
#======================
# Vectors: univariated
#======================
set.seed(1)
x <- rnorm(n=1e3,
           mean=5,
           sd=1)
ft <- fdt(x)
str(ft)
ft
summary(ft)  # the same
summary(ft,
        format=TRUE)      # It can not be what you want to publications!
summary(ft,
        format=TRUE,
        pattern='%.2f')   # Huumm ..., good, but ... Can it be better?
summary(ft,
        col=c(1:2, 4, 6),
        format=TRUE,
        pattern='%.2f')   # Yes, it can!
range(x)                  # To know x
summary(fdt(x,
            start=1,
            end=9,
            h=1),
        col=c(1:2, 4, 6),
        format=TRUE,
        pattern='%d')     # Is it nice now?
ft[['table']]                        # Stores the feq. dist. table (fdt)
ft[['breaks']]                       # Stores the breaks of fdt
ft[['breaks']]['start']              # Stores the left value of the first class
ft[['breaks']]['end']                # Stores the right value of the last class
ft[['breaks']]['h']                  # Stores the class interval
as.logical(ft[['breaks']]['right'])  # Stores the right option
#=============================================
# Data.frames: multivariated with categorical
#=============================================
mdf <- data.frame(X1=rep(LETTERS[1:4], 25),
                  X2=as.factor(rep(1:10, 10)),
                  Y1=c(NA, NA, rnorm(96, 10, 1), NA, NA),
                  Y2=rnorm(100, 60, 4),
                  Y3=rnorm(100, 50, 4),
                  Y4=rnorm(100, 40, 4),
                  stringsAsFactors=TRUE)
                  
ft_c <- fdt_cat(mdf)
summary(ft_c)
ft <- fdt(mdf,
          na.rm=TRUE)
str(ft)
summary(ft)  # the same
summary(ft,
        format=TRUE)
summary(ft,
        format=TRUE,
        pattern='%05.2f')  # regular expression
summary(ft,
        col=c(1:2, 4, 6),
        format=TRUE,
        pattern='%05.2f')
print(ft,
      col=c(1:2, 4, 6))
print(ft,
      col=c(1:2, 4, 6),
      format=TRUE,
      pattern='%05.2f')
levels(mdf$X1)
summary(fdt(mdf,
            k=5,
            by='X1',
            na.rm=TRUE))
levels(mdf$X2)
summary(fdt(mdf, 
            breaks='FD',
            by='X2',
            na.rm=TRUE),
        round=3)
summary(fdt(mdf,
            k=5,
            by='X2',
            na.rm=TRUE),
        format=TRUE,
        round=3)
summary(fdt(iris,
            k=5),
        format=TRUE,
        patter='%04.2f')
levels(iris$Species)
summary(fdt(iris, 
            k=5, 
            by='Species'), 
        format=TRUE, 
        patter='%04.2f')
#=========================
# Matrices: multivariated
#=========================
summary(fdt(state.x77),
        col=c(1:2, 4, 6),
        format=TRUE)
summary(fdt(volcano,
            right=TRUE),
        col=c(1:2, 4, 6),
        round=3,
        format=TRUE,
        pattern='%05.1f')
Variance of frequency distribution table (numerical variable)
Description
S3 methods for the variance of a fdt. 
 
Useful to estimate the variance (when the real data vector is not known) from a previous fdt.
Usage
## S3 generic
var(x, ...)
## S3 methods: numerical
## Default S3 method:
var(x, ...)
## S3 method for class 'fdt'
var(x, ...)
## S3 method for class 'fdt.multiple'
var(x, ...)
Arguments
| x | a  | 
| ... | required to be generic. | 
Details
var.fdt calculates the value of the variance based on a known formula. 
var.fdt.multiple calls var.fdtfor each variable, that is, each column of the data.frame.
Value
var.fdt returns a numeric vector containing the value of the median of the fdt.
median.fdt.multiple returns a list, where each element is a numeric vector 
containing the value of the variance of the fdt for each variable.
Author(s)
Faria, J. C. 
Allaman, I. B 
Jelihovschi, E. G.
See Also
sd.fdt, mean.fdt.
Examples
mdf <- data.frame(x=rnorm(1e2, 
                          20, 
                          2),
                  y=rnorm(1e2, 
                          30,
                          3),
                  z=rnorm(1e2,
                          40,
                          4))
head(mdf)
apply(mdf,
      2,
      var)
var(fdt(mdf))      
LaTeX table of the frequency distribution table
Description
This function returns a LaTeX table of the fdt, fdt.multiple and fdt_cat.multiple objects of the xtable class.
Usage
## S3 method for class 'fdt'
xtable(x,
       caption = NULL,
       label = NULL,
       align = NULL,
       digits = NULL,
       display = NULL,
       auto = FALSE,
       ...)
## S3 method for class 'fdt.multiple'
xtable(x,
       caption = NULL,
       label = NULL,
       align = NULL,
       digits = NULL,
       display = NULL,
       ...)
## S3 method for class 'fdt_cat.multiple'
xtable(x,
       caption = NULL,
       label = NULL,
       align = NULL,
       digits = NULL,
       display = NULL,
       ...)
Arguments
| x | A  | 
| caption | Character vector of length 1 or 2 containing the table's caption or title. See  | 
| label | Character vector of length 1 containing the LaTeX label or HTML anchor. See  | 
| align | Character vector of length equal to the number of columns of the resulting table, indicating the alignment of the corresponding columns. See  | 
| digits | Numeric vector of length equal to one (in which case it will be replicated as necessary) or to the number of columns of the resulting table or matrix of the same size as the resulting table, indicating the number of digits to display in the corresponding columns. See  | 
| display | Character vector of length equal to the number of columns of the resulting table, indicating the format for the corresponding columns. See  | 
| auto | Logical, indicating whether to apply automatic format when no value is passed to  | 
| ... | Additional arguments. | 
Details
The functions latex.fdt was deprecated. We understand over the years that creating a method for the generic xtable function would be inevitable, given the advancement of the xtable package and its support by the academic community.
Then, the fdt, fdt.multiple and fdt_cat.multiple methods were created for the generic xtable function. For object of the fdt_cat class, no methods were created, as they inherit the data.frame class, and therefore, the xtable functions can be used directly for such objects.
Objects of the fdt.multiple and fdt_cat.multiple class, when using the xtable function, will have the xtableList class. Although it may seem confusing, the xtableList function in the xtable package has no generic function and therefore it was not possible to create a method of type xtableList.fdt.multiple. Therefore, a method of the xtable.fdt.multiple class was created, but the function xtableList is being used internally.
More examples than those provided in the manual can be seen in the vignette.
It is possible to select what columns of the table (a data.frame) will be shown, as well as the pattern of the first column. The columns are:
-  ‘Class limits’ 
-  ‘f’ - Absolute frequency 
-  ‘rf’ - Relative frequency 
-  ‘rf(%)’ - Relative frequency, % 
-  ‘cf’ - Cumulative frequency 
-  ‘cf(%)’ - Cumulative frequency, % 
Value
An object of the class xtable.fdt, xtable.fdt.multiple and xtable.fdt_cat.multiple.
Author(s)
Faria, J. C. 
Allaman, I. B 
Jelihovschi, E. G.
See Also
Examples
library(fdth)
library(xtable)
# +++++ Quantitative data
## Example 1: The simplest possible
t1 <- fdt(rnorm(n=1e3,
                mean=10,
                sd=2))
t1x <- xtable(t1)
t1x
## Example 2
print(t1x,
      include.rownames=FALSE)
## Example 3
newclass <- gsub("[\\[\\)]",
                 "",
                 t1x[,1],
                 perl=TRUE)
t3x <- t1x
t3x[,1] <- newclass
print(t3x,
      include.rownames=FALSE,
      sanitize.text.function = function(x)gsub(",",
                                               "\\dashv",
                                               x,
                                               perl = TRUE))
## Not run: 
## Example 4
clim <- t1$table[1]
clim1 <- sapply(clim,
                as.character)
right <- t1$breaks[4]
pattern <- "
clim2 <- make.fdt.format.classes(clim1,
                                 right,
                                 pattern)
clim3 <- sapply(clim2,
                function(x) paste0("$",
                                   x,
                                   "$"))
t4x <- t1x
t4x[,1] <- clim3
print(t4x,
      include.rownames=FALSE)
## End(Not run)
## Example 5
t5 <- fdt(iris,
          by="Species")
attr(t5, "subheadings") <- paste0("Variable = ",
                      names(t5))
xtable(t5)
# +++++ Qualitative data
## Example 6
t6 <- fdt_cat(sample(LETTERS[1:3], 
                     replace=TRUE,
                     size=30))
t6x <- xtable(t6)
t6x
t61 <- fdt_cat(data.frame(c1=sample(LETTERS[1:3],
                                    replace=TRUE,
                                    size=10),
                          c2=sample(letters[4:5],
                                    replace=TRUE,
                                    size=10),
                          stringsAsFactors=TRUE))
attr(t61, "subheadings") <- paste0("Variable = ",
                                   names(t61))
t61x <- xtable(t61)
t61x