Skip to content

Latest commit

 

History

History
48 lines (35 loc) · 1.28 KB

README.md

File metadata and controls

48 lines (35 loc) · 1.28 KB

data.tools: Tools for data manipulation

Package that includes functions for data manipulation. It is suggested to use the package with data.table.

Installation

You can install the package using devtools:

devtools::install_github("santiagohermo/data.tools")

Usage

library(data.tools)

# Simulate panel data
dt <- data.table::data.table(unit = c("A", "A", "B", "B", "C", "C", "D", "D", "E", "E"), 
                             time = rep(c(1, 2), 5))
dt[, y := rnorm(.N)]

# Create equal-sized groups
dt[, terciles_y        := cut_in_n(y, n=3)]
dt[, terciles_y_within := cut_in_n(y, n=3), by=time]

# Save the data set to a csv file with a log file
save_data(dt, key = c("unit", "time"), 
          outfile = "data.csv")

# Save the data set to a feather without a log file
save_data(dt, key = c("unit", "time"), 
          outfile = "data.feather", 
          logfile = FALSE)

Available functions

  • cut_in_n: Bin a numeric vector into n equal-sized groups.
  • save_data: Save data set to a file with a log file, several formats are available.
  • weighted_sd and weighted_var: Compute weighted standard deviation and variance.