Main Page | Class Hierarchy | Class List | Class Members

kddml.Operators.Preprocessing.PP_MERGE_DUPLICATES_RESOLVER Class Reference

Inheritance diagram for kddml.Operators.Preprocessing.PP_MERGE_DUPLICATES_RESOLVER:

kddml.Operators.Preprocessing.PP_MARK_DUPLICATES_RESOLVER kddml.Operators.Preprocessing.InstanceLevelDependentTransformation kddml.Operators.Preprocessing.PreprocessingResolver kddml.Operators.OperatorResolver kddml.Operators.HWResourcesDescription List of all members.

Protected Member Functions

Instances[] runCore (Instances tuple, Instances metatuple) throws ResolverException, KDDMLCoreException
String getHistoryDescription ()

Detailed Description

The operator finds and unifies instances that are duplicates. Two instances are duplicates on the basis of a key composed by a list of attributes. As an example, consider the attributes temperature and outlook as the key. In this case, two instances are duplicates if they have the same values for those attributes. When two or more instances have been selected as duplicates, the operator chooses4 only one instance as the representant. All the other instances are removed from the input preprocessing table. Finally, all key attributes of this instance are marked (i.e. a string is added to preprocessing information) with a specified value.

Title: KDDML

Description: Knowledge Discovery in Database Environment

Copyright: Copyright (c) 2003-2005

Company: Universita' di Pisa - Dipartimento di Informatica

Author:
Andrea Romei (romei@di.unipi.it)

Sandra Zimei

Version:
2.0.16


Member Function Documentation

Instances [] kddml.Operators.Preprocessing.PP_MERGE_DUPLICATES_RESOLVER.runCore Instances  tuple,
Instances  metatuple
throws ResolverException, KDDMLCoreException [protected, virtual]
 

Core operator method. Given the physical tuples and the related preprocessing tuples as weka.core.Instances, the operator returns the modified instances as two-dimensional array.

Parameters:
tuple Instances the entire input dataset as weka.core.Instances
metatuple Instances the entire input preprocessing dataset as weka.core.Instances. The number of metatuple coincides with the number of physical tuples.
Returns:
Instances[] a two-dimensional array containing the calculated output instances. The first element contains the physical instaces; the second element contains the related preprocessing instances. Output schema of instances must be compatible with input schemata and the number of ouput instances must coincide with the number of input instances.
Exceptions:
ResolverException if a resolving error occurs.
KDDMLCoreException if a level core error occurs.

Reimplemented from kddml.Operators.Preprocessing.PP_MARK_DUPLICATES_RESOLVER.

String kddml.Operators.Preprocessing.PP_MERGE_DUPLICATES_RESOLVER.getHistoryDescription  )  [protected, virtual]
 

Returns a description of the actions performed by this preprocessing operator. This description will be reported in the history related to the preprocessing data source.

Returns:
String
Exceptions:
KDDMLCoreException 

Reimplemented from kddml.Operators.Preprocessing.PP_MARK_DUPLICATES_RESOLVER.


Generated on Thu Feb 23 13:04:53 2006 for kddml by  doxygen 1.4.3