Dataset

scalation.theory.Dataset
case class Dataset(name: String, fileName: String, ncols: Int, xcols: Array[Int], ycol: Int)

The Dataset class supports loading data files (e.g., CSV files) and pre-processing them to create predictor/input matrices and response/output vectors. FIX -- extend to models that allow multiple responses/outputs (vector -> matrix).

Value parameters

fileName

the name of the file storing the dataset

name

the name of the dataset

xcols

the desired column numbers to take for predictors

ycol

the desired column number to take for the response

Attributes

Graph
Supertypes
trait Serializable
trait Product
trait Equals
class Object
trait Matchable
class Any
Show all

Members list

Value members

Concrete methods

def preProcess(tab: LTable): LTable

Pre-process the raw data that may contain missing values, strings, dates, id columns, zero-variance columns, and outliers. Return the cleaned-up data.

Pre-process the raw data that may contain missing values, strings, dates, id columns, zero-variance columns, and outliers. Return the cleaned-up data.

Value parameters

tab

the table/linked-relation containing the raw data

Attributes

def show(tab: LTable = ...): Unit

Show/print the table/linked-relation containing the data.

Show/print the table/linked-relation containing the data.

Value parameters

tab

the table/linked-relation containing the data

Attributes

Inherited methods

def productElementNames: Iterator[String]

An iterator over the names of all the elements of this product.

An iterator over the names of all the elements of this product.

Attributes

Inherited from:
Product
def productIterator: Iterator[Any]

An iterator over all the elements of this product.

An iterator over all the elements of this product.

Attributes

Returns

in the default implementation, an Iterator[Any]

Inherited from:
Product

Concrete fields

val data: LTable
val fname: Array[String]
val ofname: Array[String]
val ox: MatrixD
val rname: String
val x: MatrixD
val xy: MatrixD
val y: VectorD