bdImportData_hdf5

HDF5_IO_MANAGEMENT

1 Usage

bdImportData_hdf5(...)

2 Arguments

Parameter Description
inFile Character string specifying either a local file path or URL containing the data to import
destFile Character string specifying the file name and path where the HDF5 file will be stored
destGroup Character string specifying the group name within the HDF5 file where the dataset will be stored
destDataset Character string specifying the name for the dataset within the HDF5 file
header Logical or character vector. If TRUE, the first row contains column names. If a character vector, use these as column names. Default is TRUE.
rownames Logical or character vector. If TRUE, first column contains row names. If a character vector, use these as row names. Default is FALSE.
overwrite Logical indicating if existing datasets should be overwritten. Default is FALSE.
overwriteFile Logical indicating if the entire HDF5 file should be overwritten if it exists. CAUTION: This will delete all existing data. Default is FALSE.
sep Character string specifying the field separator in the input file. Default is “\t” (tab).
paral Logical indicating whether to use parallel computation. Default is TRUE.
threads Integer specifying the number of threads to use for parallel computation. Only used if paral=TRUE. If NULL, uses maximum available threads.

3 Value

No return value. The function writes the data directly to the specified HDF5 file.

4 Examples

Code
# Import from local file
bdImportData_hdf5(
  inFile = "data.txt",
  destFile = "output.h5",
  destGroup = "mydata",
  destDataset = "matrix1",
  header = TRUE,
  sep = "\t"
)

# Import from URL
bdImportData_hdf5(
  inFile = "https://example.com/data.csv",
  destFile = "output.h5",
  destGroup = "downloaded",
  destDataset = "remote_data",
  sep = ","
)