Class S3DataSpec
- All Implemented Interfaces:
Serializable, Cloneable
Describes the data specification of a DataSource.
- See Also:
-
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionclone()booleanThe location of the data file(s) used by aDataSource.Describes the splitting requirement of aDatasource.A JSON string that represents the schema for an Amazon S3DataSource.Describes the schema Location in Amazon S3.inthashCode()voidsetDataLocationS3(String dataLocationS3) The location of the data file(s) used by aDataSource.voidsetDataRearrangement(String dataRearrangement) Describes the splitting requirement of aDatasource.voidsetDataSchema(String dataSchema) A JSON string that represents the schema for an Amazon S3DataSource.voidsetDataSchemaLocationS3(String dataSchemaLocationS3) Describes the schema Location in Amazon S3.toString()Returns a string representation of this object; useful for testing and debugging.withDataLocationS3(String dataLocationS3) The location of the data file(s) used by aDataSource.withDataRearrangement(String dataRearrangement) Describes the splitting requirement of aDatasource.withDataSchema(String dataSchema) A JSON string that represents the schema for an Amazon S3DataSource.withDataSchemaLocationS3(String dataSchemaLocationS3) Describes the schema Location in Amazon S3.
-
Constructor Details
-
S3DataSpec
public S3DataSpec()
-
-
Method Details
-
setDataLocationS3
The location of the data file(s) used by a
DataSource. The URI specifies a data file or an Amazon Simple Storage Service (Amazon S3) directory or bucket containing data files.- Parameters:
dataLocationS3- The location of the data file(s) used by aDataSource. The URI specifies a data file or an Amazon Simple Storage Service (Amazon S3) directory or bucket containing data files.
-
getDataLocationS3
The location of the data file(s) used by a
DataSource. The URI specifies a data file or an Amazon Simple Storage Service (Amazon S3) directory or bucket containing data files.- Returns:
- The location of the data file(s) used by a
DataSource. The URI specifies a data file or an Amazon Simple Storage Service (Amazon S3) directory or bucket containing data files.
-
withDataLocationS3
The location of the data file(s) used by a
DataSource. The URI specifies a data file or an Amazon Simple Storage Service (Amazon S3) directory or bucket containing data files.- Parameters:
dataLocationS3- The location of the data file(s) used by aDataSource. The URI specifies a data file or an Amazon Simple Storage Service (Amazon S3) directory or bucket containing data files.- Returns:
- Returns a reference to this object so that method calls can be chained together.
-
setDataRearrangement
Describes the splitting requirement of a
Datasource.- Parameters:
dataRearrangement- Describes the splitting requirement of aDatasource.
-
getDataRearrangement
Describes the splitting requirement of a
Datasource.- Returns:
- Describes the splitting requirement of a
Datasource.
-
withDataRearrangement
Describes the splitting requirement of a
Datasource.- Parameters:
dataRearrangement- Describes the splitting requirement of aDatasource.- Returns:
- Returns a reference to this object so that method calls can be chained together.
-
setDataSchema
A JSON string that represents the schema for an Amazon S3
DataSource. TheDataSchemadefines the structure of the observation data in the data file(s) referenced in theDataSource.Define your
DataSchemaas a series of key-value pairs.attributesandexcludedVariableNameshave an array of key-value pairs for their value. Use the following format to define yourDataSchema.{ "version": "1.0",
"recordAnnotationFieldName": "F1",
"recordWeightFieldName": "F2",
"targetFieldName": "F3",
"dataFormat": "CSV",
"dataFileContainsHeader": true,
"attributes": [
{ "fieldName": "F1", "fieldType": "TEXT" }, { "fieldName": "F2", "fieldType": "NUMERIC" }, { "fieldName": "F3", "fieldType": "CATEGORICAL" }, { "fieldName": "F4", "fieldType": "NUMERIC" }, { "fieldName": "F5", "fieldType": "CATEGORICAL" }, { "fieldName": "F6", "fieldType": "TEXT" }, { "fieldName": "F7", "fieldType": "WEIGHTED_INT_SEQUENCE" }, { "fieldName": "F8", "fieldType": "WEIGHTED_STRING_SEQUENCE" } ],
"excludedVariableNames": [ "F6" ] }
invalid input: '<'?oxy_insert_end>- Parameters:
dataSchema- A JSON string that represents the schema for an Amazon S3DataSource. TheDataSchemadefines the structure of the observation data in the data file(s) referenced in theDataSource.Define your
DataSchemaas a series of key-value pairs.attributesandexcludedVariableNameshave an array of key-value pairs for their value. Use the following format to define yourDataSchema.{ "version": "1.0",
"recordAnnotationFieldName": "F1",
"recordWeightFieldName": "F2",
"targetFieldName": "F3",
"dataFormat": "CSV",
"dataFileContainsHeader": true,
"attributes": [
{ "fieldName": "F1", "fieldType": "TEXT" }, { "fieldName": "F2", "fieldType": "NUMERIC" }, { "fieldName": "F3", "fieldType": "CATEGORICAL" }, { "fieldName": "F4", "fieldType": "NUMERIC" }, { "fieldName": "F5", "fieldType": "CATEGORICAL" }, { "fieldName": "F6", "fieldType": "TEXT" }, { "fieldName": "F7", "fieldType": "WEIGHTED_INT_SEQUENCE" }, { "fieldName": "F8", "fieldType": "WEIGHTED_STRING_SEQUENCE" } ],
"excludedVariableNames": [ "F6" ] }
-
getDataSchema
A JSON string that represents the schema for an Amazon S3
DataSource. TheDataSchemadefines the structure of the observation data in the data file(s) referenced in theDataSource.Define your
DataSchemaas a series of key-value pairs.attributesandexcludedVariableNameshave an array of key-value pairs for their value. Use the following format to define yourDataSchema.{ "version": "1.0",
"recordAnnotationFieldName": "F1",
"recordWeightFieldName": "F2",
"targetFieldName": "F3",
"dataFormat": "CSV",
"dataFileContainsHeader": true,
"attributes": [
{ "fieldName": "F1", "fieldType": "TEXT" }, { "fieldName": "F2", "fieldType": "NUMERIC" }, { "fieldName": "F3", "fieldType": "CATEGORICAL" }, { "fieldName": "F4", "fieldType": "NUMERIC" }, { "fieldName": "F5", "fieldType": "CATEGORICAL" }, { "fieldName": "F6", "fieldType": "TEXT" }, { "fieldName": "F7", "fieldType": "WEIGHTED_INT_SEQUENCE" }, { "fieldName": "F8", "fieldType": "WEIGHTED_STRING_SEQUENCE" } ],
"excludedVariableNames": [ "F6" ] }
invalid input: '<'?oxy_insert_end>- Returns:
- A JSON string that represents the schema for an Amazon S3
DataSource. TheDataSchemadefines the structure of the observation data in the data file(s) referenced in theDataSource.Define your
DataSchemaas a series of key-value pairs.attributesandexcludedVariableNameshave an array of key-value pairs for their value. Use the following format to define yourDataSchema.{ "version": "1.0",
"recordAnnotationFieldName": "F1",
"recordWeightFieldName": "F2",
"targetFieldName": "F3",
"dataFormat": "CSV",
"dataFileContainsHeader": true,
"attributes": [
{ "fieldName": "F1", "fieldType": "TEXT" }, { "fieldName": "F2", "fieldType": "NUMERIC" }, { "fieldName": "F3", "fieldType": "CATEGORICAL" }, { "fieldName": "F4", "fieldType": "NUMERIC" }, { "fieldName": "F5", "fieldType": "CATEGORICAL" }, { "fieldName": "F6", "fieldType": "TEXT" }, { "fieldName": "F7", "fieldType": "WEIGHTED_INT_SEQUENCE" }, { "fieldName": "F8", "fieldType": "WEIGHTED_STRING_SEQUENCE" } ],
"excludedVariableNames": [ "F6" ] }
-
withDataSchema
A JSON string that represents the schema for an Amazon S3
DataSource. TheDataSchemadefines the structure of the observation data in the data file(s) referenced in theDataSource.Define your
DataSchemaas a series of key-value pairs.attributesandexcludedVariableNameshave an array of key-value pairs for their value. Use the following format to define yourDataSchema.{ "version": "1.0",
"recordAnnotationFieldName": "F1",
"recordWeightFieldName": "F2",
"targetFieldName": "F3",
"dataFormat": "CSV",
"dataFileContainsHeader": true,
"attributes": [
{ "fieldName": "F1", "fieldType": "TEXT" }, { "fieldName": "F2", "fieldType": "NUMERIC" }, { "fieldName": "F3", "fieldType": "CATEGORICAL" }, { "fieldName": "F4", "fieldType": "NUMERIC" }, { "fieldName": "F5", "fieldType": "CATEGORICAL" }, { "fieldName": "F6", "fieldType": "TEXT" }, { "fieldName": "F7", "fieldType": "WEIGHTED_INT_SEQUENCE" }, { "fieldName": "F8", "fieldType": "WEIGHTED_STRING_SEQUENCE" } ],
"excludedVariableNames": [ "F6" ] }
invalid input: '<'?oxy_insert_end>- Parameters:
dataSchema- A JSON string that represents the schema for an Amazon S3DataSource. TheDataSchemadefines the structure of the observation data in the data file(s) referenced in theDataSource.Define your
DataSchemaas a series of key-value pairs.attributesandexcludedVariableNameshave an array of key-value pairs for their value. Use the following format to define yourDataSchema.{ "version": "1.0",
"recordAnnotationFieldName": "F1",
"recordWeightFieldName": "F2",
"targetFieldName": "F3",
"dataFormat": "CSV",
"dataFileContainsHeader": true,
"attributes": [
{ "fieldName": "F1", "fieldType": "TEXT" }, { "fieldName": "F2", "fieldType": "NUMERIC" }, { "fieldName": "F3", "fieldType": "CATEGORICAL" }, { "fieldName": "F4", "fieldType": "NUMERIC" }, { "fieldName": "F5", "fieldType": "CATEGORICAL" }, { "fieldName": "F6", "fieldType": "TEXT" }, { "fieldName": "F7", "fieldType": "WEIGHTED_INT_SEQUENCE" }, { "fieldName": "F8", "fieldType": "WEIGHTED_STRING_SEQUENCE" } ],
"excludedVariableNames": [ "F6" ] }
- Returns:
- Returns a reference to this object so that method calls can be chained together.
-
setDataSchemaLocationS3
Describes the schema Location in Amazon S3.
- Parameters:
dataSchemaLocationS3- Describes the schema Location in Amazon S3.
-
getDataSchemaLocationS3
Describes the schema Location in Amazon S3.
- Returns:
- Describes the schema Location in Amazon S3.
-
withDataSchemaLocationS3
Describes the schema Location in Amazon S3.
- Parameters:
dataSchemaLocationS3- Describes the schema Location in Amazon S3.- Returns:
- Returns a reference to this object so that method calls can be chained together.
-
toString
-
equals
-
hashCode
-
clone
-