Main Page | Class Hierarchy | Class List | Class Members

kddml.Operators.Preprocessing.PP_MERGE_DUPLICATES_SETTINGS Class Reference

Inheritance diagram for kddml.Operators.Preprocessing.PP_MERGE_DUPLICATES_SETTINGS:

kddml.Operators.Preprocessing.PP_MARK_DUPLICATES_SETTINGS kddml.Operators.Preprocessing.PreprocessingSettings kddml.Operators.OperatorSettings List of all members.

Public Member Functions

String getDefaultAttributeValue (String attribute_name)

Detailed Description

The operator finds and unifies instances that are duplicates. Two instances are duplicates on the basis of a key composed by a list of attributes. As an example, consider the attributes temperature and outlook as the key. In this case, two instances are duplicates if they have the same values for those attributes. When two or more instances have been selected as duplicates, the operator chooses4 only one instance as the representant. All the other instances are removed from the input preprocessing table. Finally, all key attributes of this instance are marked (i.e. a string is added to preprocessing information) with a specified value.
Attributes:

Title: KDDML

Description: Knowledge Discovery in Database Environment

Copyright: Copyright (c) 2003-2005

Company: Universita' di Pisa - Dipartimento di Informatica

Author:
Andrea Romei (romei@di.unipi.it)

Sandra Zimei

Version:
2.0.16


Member Function Documentation

String kddml.Operators.Preprocessing.PP_MERGE_DUPLICATES_SETTINGS.getDefaultAttributeValue String  attribute_name  )  [virtual]
 

Returns the default value related to a given input attribute. Returns null if the attribute do not admit a default value. An attribute can be given any legal value as a default. The attribute value is not required on each element in the document, but if it is not present, it will appear to be the specified default. If the attribute is implied and it is not included in the element, the operator assumes that this is the attribute value.

Parameters:
attribute_name String
Returns:
String

Reimplemented from kddml.Operators.Preprocessing.PP_MARK_DUPLICATES_SETTINGS.


Generated on Thu Feb 23 13:04:53 2006 for kddml by  doxygen 1.4.3