Incomplete information system and rough set theory. An incremental approach to attribute reduction from dynamic incomplete decision systems in rough set theory. Incompleteness in xml was addressed recently 4, 7, 15, and one issue that required a complete reworking was the concept of certain answers for queries that return xml documents i. A rough setbased incremental approach for learning knowledge in.
And uncertainty measure is to reflect the uncertainty of an information system. A relative tolerance relation of rough set for incomplete. The traditional rough set theory is a powerful tool to deal with complete information system, and its performance to process incomplete information system is weak, m. Compute the number of supporting objects for each reduct after combining the identical. Journal of chemical and pharmaceutical research, 2014, 66.
An algorithm to solve attribute reduction in an incomplete decision table is designed. We study the method of multisource fusion in incomplete multisource systems. Choi kaist logic and set theory october 7, 2012 1 26. Research article a modified rough set approach to incomplete information systems. The original rough set theory 1, 2 deals with precise. Evidence theory is an effective method to deal with uncertainty information.
Data mining with rough set using mapreduce prachi patil student of me computer. Big data with rough set using map reduce authorstream. Attribute reduction is one of the most important problems in rough set theory. From a practical point of view, it is a good tool for data analysis. Grzymalabusse, on the unknown attribute values in learning from examples, in. The minimal reducts for the incomplete information system iis are as follows. Uncertainty measure based on evidence theory scientific. The aim of this paper is to present a dominancebased fuzzy rough set approach to incomplete intervalvalued information systems. Rough set theory rst is an extension of set theory for study of the intelligent systems characterized by insuf. Summary we explore faq frequently asked questions retrieval by applying hierarchical agglomerative clustering method and rough set theory.
Rough set theory, originated by pawlak 20,21, has become a. To merge these notions into a joint theory that combines their mutual strengths has been the object of a. Rough set theory overlaps with many other theories such that fuzzy sets, evidence theory, and statistics. This fundamental insight into mechanism design with incomplete information has allowed many allocation problems to be analyzed and forms the. Pdf multigranulation decisiontheoretic rough sets in. Theoretical study on a new information entropy and its use.
Based on different types of rough set models, the book presents the practical approaches to compute several reducts in terms of these models. Big data with rough set using map reduce authorstream presentation. Dynamic faq retrieval with rough set theory dengyiv chiu, peishin chen, and yachen pan faculty of information management, chung hua university, hsinchu, taiwan 300, r. A modified rough set approach to incomplete information systems. In this context, this papers proposal aims to address the limitations of rough set theory. Rough set theory can be considered as a tool to study the uncertain, indescendent data by classifying the set into ternary. Multigranulations rough set method of attribute reduction in. Rough set theory rst, first introduced by pawlak 1,2, is a powerful mathematical tool to. In early eighties, pawlak 22 introduced the theory of rough sets as an extension of set theory for the study of intelligent systems characterized by insufficient and incomplete information 22, 23, 26. Knowledge acquisition in incomplete information systems. One moment he was one person, an instant later he was another. Zheng classification with missing feature values using. We shall elaborate on the reasons for it later in the paper.
Information retrieval, machine learning, and data mining. A modified rough set approach to incomplete information. In this paper it is applied a rough set approach that takes into account an incomplete information system to study the steadystate security of an electric power system. On the unknown attribute values in learning from examples. Models and attribute reductions covers theoretical study of generalizations of rough set model in various incomplete information systems. Information attribute reduction based on the rough set theory. An incremental approach to attribute reduction from. Attribute reduction based on rough set theory starts from an information system that contains data about the objects of interest, which are characterized by a finite set of attributes. Algorithms of minimal mutual compatible granules and.
Roughsetbased decision model for incomplete information systems. Rough set theory is one of the best methods to process this kind of data. The attribute sets along with the objects in an information system. O is a nonempty finite set of objects at is a nonempty finite set of attributes, such that for any a. A granular computing approach to decision analysis using rough set theory iftikhar u. After giving definitions and concepts of knowledge dependency and knowledge dependency degree for incomplete information system in tolerance rough set model by distinguishing decision attribute containing missing attribute value or not, the result of maintaining reflectivity, transitivity, augmentation, decomposition law and merge law for. Nb note bene it is almost never necessary in a mathematical proof to remember that a function is literally a set of ordered pairs. A novel threeway decision model based on incomplete. Knowledge reduction algorithm the rough set theory is a mathematical tool which can quantitatively analyze the imprecise, inconsistent and incomplete information and knowledge. Moreover, concepts of lower and upper approximations are studied as well as their properties. Situation theory barwise and perry 1983, devlin i 991 provides formal mechanisms by way of constraints between situation types that made it possible to merge. Combining both kinds of data can result in an improve.
Multigranulation decisiontheoretic rough sets in incomplete information systems article pdf available in international journal of machine learning and cybernetics 66 august 2015 with 165 reads. Roughsetbased decision model for incomplete information. Knowledge dependency degree in trsm and its application to. So far, although some achievements based on divided and conquer method in the rough set theory have been acquired, the systematic methods for knowledge reduction based on divide and conquer method are still absent. Rough set theory is an extension of set theory which proposed by pawlak 1991 for describe and classify the incomplete or insufficient information. This paper discusses and proposes a rough set model for an incomplete information system, which defines an extended tolerance relation using frequency of attribute values in such a system. U, g x, f denote the value that x holds on feature f.
He was choked with indignation and sorrow, as though his good qualities had been stripped from him by a rough hand, like medals. Pdf a new rough set approach to knowledge discovery in. As such, organizations would benefit from partitioning the electorate to not duplicate. Multigranulation rough sets in incomplete information system. Zheng and wang developed a rough set and rule tree based incremental knowledge. There have been efforts in studying incomplete information systems for data classification which are based on the extensions of rough set theory. A noisetolerant approach to fuzzyrough feature selection. Rough set theory, proposed by pawlak in the early 1980s 24, 25, is a mathematical tool to. Combining system theory, process mining and fuzzy logic authors. If we wish to understand how it is organized, we could begin by looking at the melody, which seems to naturally break. How important is it to be able to easily compare and merge. Accordingly, an elementary set is any set of objects that are not different, a sharp precise set any union of some elementary sets, otherwise the set is rough imprecise, vague.
After nearly thirty years of development, rough set theory has been widely used in the fields of. Two kinds of partitions, lower and upper approximations, are then formed for the mining of certain and association rules in incomplete decision tables. Extended tolerance relation to define a new rough set. Decision rough set models as new research areas have already commenced to become new attractive topics 17. The paper introduces a rough set model to analyze an information system in which some conditions and decision data are. Pdf the original rough set model is concerned primarily with the approximation of sets described by single binary relation on the universe. Extended tolerance relation to define a new rough set model in. The rough set theory has been conceived as a tool to conceptualize, organize and analyze various types of data, in particular, to deal with inexact, uncertain or vague knowledge.
Theory behind shotgun sequencing haemophilus influenzae 1. Globalization has created new trends such as market consolidation, vertical market strategies and mergers in the business world. This relationship of not distinguishing is a mathematical basis for the theory of rough sets. Incomplete information system and rough set theory models and. A fuzzy dominance relation which aims to describe the degree of dominance in terms of pairs of objects is proposed. Procedia apa bibtex chicago endnote harvard json mla ris xml iso 690 pdf downloads 350. It first discusses some rough set extensions in incomplete information systems. This is due to the volume, complexity, and heterogeneity of such datasets, as well as fundamental gaps in our knowledge of highdimensional processes where distance measures degenerate curse of dimensionality 1, 2. The concept of similarity classes in incomplete information systems is first proposed. Feature subset selection using rough sets for high. The discretization algorithm for rough data and its. Clark ross consider and play the opening to schoenbergs three piano pieces, op. The book is intended for researchers and postgraduate students in machine learning, data mining and knowledge discovery, especially for those who are working in rough set theory, and granular computing. Rough set extensions in incomplete information systems.
A novel threeway decision model based on incomplete information system. It discusses not only the regular attributes but also the criteria in the incomplete information systems. The indiscernibility relation is a fundamental concept of the rough set theory. Was he only a set of reflectionspancakelike specters with shifting featuresstaring at one another from ghostly mirrors. At, where is called the domain of an attribute a, is called an information vector of x any attribute domain v. Knowledge reduction based on divide and conquer method in. A granular computing approach to decision analysis using. Pdf evolutionary computation for rough set models in. Despite the importance of theory, questions relating to its form and structure are neglected in comparison with questions relating to epistemology. Analysis of an incomplete information system using the. The divide and conquer method is a typical granular computing method using multiple levels of abstraction and granulations. This paper discusses the information fusion and uncertainty measure based on rough set theory. Next, probability of matching is defined from data in information systems and then measures the degree of tolerance. Y ang, expansions of rough sets in incomplete information systems, incomplete information system and rough set theory, science press beijing and springerv erlag berlin.
Finding frequent items in probabilistic data proceedings. By analyzing existing extended models and technical methods of rough set theory, the strategy of model extension is found to be suitable for processing incomplete information systems instead of filling possible values for missing attributes. Computing statistical information on probabilistic data has attracted a lot of attention recently, as the data generated from a wide range of data sources are inherently fuzzy or uncertain. Importance of diffing and merging for design specifications documentation.
The discretization algorithm for rough data and its application to intrusion detection. Incomplete information and certain answers in general data. However, classical rough set theory cannot cope with the incomplete information systems where some attribute values are missing. A novel decisionmaking approach to fund investments based on. By combining these properties, one can construct distinct rough set models. Rough set theory is an extension of set theory which proposed by pawlak 1991 for. Introduction rough set theory rst for short 1 is put forward by pawlak in 1982, which, as an generalization of set theory for. An overview of dna sequencing michigan state university. We can combine the rules which have the same generalized decisions as. Some of them may be shared outside the team, but only in a processed, noneditable form pdf, all the docs are assumed being able to be exported to this format. There is no unifying theory, single method, or unique set of tools for big data science. Choi department of mathematical science kaist, daejeon, south korea fall semester, 2012 s. Zheng classification system with missing feature values in tcm can be viewed as a 3tuple s.
The essay addresses issues of causality, explanation, prediction, and generalizati on that underlie an understanding of theory. Next, probability of matching is defined from data in information systems and. This paper deals with knowledge acquisition in incomplete information systems using rough set theory. Parallel computation of rough set approximations in information. An extended rough set model for generalized incomplete. Rough set approaches to incomplete information systems. Sikder department of computer and information science, cleveland state university, cleveland, oh 44115, usa abstract this paper presents a granular computing approach to decision analysis using rough set theory and its variable precision extension. Logic and information stanford encyclopedia of philosophy. In this paper, we study an important statistical query on probabilistic data. All eight possible extended rough set models in incomplete information systems are proposed.
Thus we want to merge evidence theory with uncertainty method in order to measure the roughness of a rough approximation space. It merges these values together to create a possibly smaller. Besides it is mathematical tool that overcome the uncertainties and doubts. Database integration is a growing and increasingly important field in both research and in dustry. Rough set approach to incomplete information systems. Therefore, it is necessary to develop a theory which enables classifications of. Where m index termsalgorithm, incomplete information system, minimal granule, multigranulation, rough set model.
1104 795 1509 1390 795 1153 233 413 944 1276 598 1154 1257 238 82 870 562 590 720 1516 802 1379 114 803 619 1065 278 1110 1318 170 559 2 654 975 1114 453 1417 491 856 516 1327 507 803