PMML Features and Attributes

Using External Models With Vertica gives an overview of the features Vertica supports for working with external models. This topic provides additional details on limitations in how Vertica supports working with PMML models.

With PMML models, Vertica currently supports only:

  • PMML models that do not contain a data preprocessing step.
  • PMML models that encode only these model types: k-means, linear regression, logistic regression, random forest (classifier and regressor), GeneralRegressionModel, and TreeModel.

Supported PMML Attributes

The following table lists supported PMML tags and their attributes.

XML-tag name ignored attributes supported attributes unsupported attributes ignored sub-tags supported sub-tags unsupported sub-tags
Categories -

-

- - Category Extension
Category - value (required) - - - Extension
CategoricalPredictor -
  • name (required)
  • value (required)
  • coefficient (required)
- - - Extension
Cluster size
  • id
  • name
-
  • KohonenMap
  • Covariances
NUM-ARRAY
ClusteringField -
  • field (required)
  • isCenterField (only "true" is supported)
  • compareFunction
  • fieldWeight
  • similarityScale
- -
ClusteringModel modelName
  • functionName (required, only "clustering" is supported)
  • algorithmName
  • modelClass (required, only "centerBased" is supported)
  • numberOfClusters(required)
  • isScorable (only "true" is supported)
- ModelVerification
  • MiningSchema
  • ComparisonMeasure
  • ClusteringField
  • Cluster
  • Extension
  • Output
  • ModelStats
  • ModelExplanation
  • LocalTransformations
  • MissingValueWeights
  • ModelVerification
ComparisonMeasure
  • minimum
  • maximum
  • kind (required, only "distance" is supported)
  • compareFunction
- -
  • euclidean
  • squaredEuclidean
  • Extension
  • chebychev
  • cityBlock
  • minkowski
  • simpleMatching
  • jaccard
  • tanimoto
  • binarySimilarity
CompoundPredicate - booleanOperator (required) - - Extension
CovariateList - - - - Predictor Extension
DataDictionary - numberOfFields - - DataField
DataField displayName
  • name (required)
  • optype (required)
  • dataType (required)
  • taxonomy
  • isCyclic
- Value
DecisionTree - - - -
FactorList - - - - Predictor Extension
False - - - - - Extension
GeneralRegressionModel
  • modelName
  • targetVariableName
  • startTimeVariable
  • subjectIDVariable
  • modelType (required)
  • functionName (required)
  • algorithmName
  • targetReferenceCategory
  • cumulativeLink
  • linkFunction
  • linkParameter
  • trialsVariable
  • trialsValue
  • distribution
  • distParameter
  • offsetVariable
  • offsetValue
  • modelDF
  • isScoreable (only "true" is supported)
  • endTimeVariable
  • statusVariable
  • baselineStrataVariable
Header
  • copyright
  • description
  • modelVersion
- -
  • Extension
  • Application
  • Annotation
  • Timestamp
- -
MiningField
  • importance
  • missingValueTreatment
  • name(required)
  • usageType
  • optype
  • outliers
  • lowValue
  • highValue
  • missingValueReplacement
  • invalidValueTreatment
- - Extension
MiningModel
  • modelName
  • algorithmName
  • functionName (required)
  • isScoreable (only "true" is supported)
- ModelVerification
MiningSchema - - - - MiningField Extension
Node -
  • id
  • score
  • recordCount
  • defaultChild
- -
NumericPredictor -
  • name (required)
  • exponent
  • coefficient (required)
- - - Extension
Output - - - - OutputField Extension
OutputField
  • displayName
  • opType
  • name (required)
  • dataType (required)
  • feature
  • value
  • isFinalResult
  • targetField
  • ruleFeature
  • algorithm
  • rankBasis
  • segmentId
  • rank
  • rankOrder
  • isMultiValued
- -
Parameter -
  • name (required)
  • label
referencePoint - - Extension
ParameterList - - - - Parameter Extension
ParamMatrix - - - - PCell Extension
PCell -
  • parameterName (required)
  • targetCategory
  • beta (required)
  • df
- - - Extension
PPCell -
  • parameterName (required)
  • predictorName (required)
  • parameterName (required)
  • targetCategory
- - - Extension
PPMatrix - - - - PPCell Extension
PMML -
  • version (required)
  • xmlns
- MiningBuildTask
  • Header
  • DataDictionary
  • ClusteringModel
  • RegressionModel
  • TransformationDictionary
  • Extension
  • any unsupported model type
Predictor -
  • name (required)
  • contrastMatrixType
- - Extension
RegressionModel
  • modelName
  • targetFieldName
  • modelType
  • functionName (required)
  • algorithmName
  • normalizationMethod
  • isScorable (only "true" is supported)
- ModelVerification
  • MiningSchema
  • RegressionTable
  • Extension
  • Output
  • ModelStats
  • ModelExplanation
  • LocalTransformations
  • Targets
  • ModelVerification
RegressionTable -
  • intercept (required)
  • targetCategory
- -
  • CategoricalPredictor
  • NumericPredictor
  • Extension
  • CategoricalPredictor
  • PredictorTerm
Segment - id weight -
Segmentation
  • missingPredictionTreatment
  • missingThreshold
multipleModelMethod (required) - - Segment Extension
SimplePredicate -
  • field (required)
  • operator (required)
  • value
- - - Extension
SimpleSetPredicate -
  • field (required)
  • booleanOperator (required)
- - ARRAY Extension
TreeModel
  • functionName (required)
  • algorithmName
  • noTrueChildStrategy
  • splitCharacteristic
  • isScorable (only "true" is supported)
- ModelVerification
True - - - - - Extension
ScoreDistribution -
  • value (required)
  • recordCount (required)
- - - Extension
Value displayValue
  • value (required)
  • property
- - - Extension