Seems you have not registered as a member of localhost.saystem.shop!

You may have to register before you can download all our books and magazines, click the sign up button below to create a free account.

Sign up

Data Profiling
  • Language: en
  • Pages: 136

Data Profiling

Data profiling refers to the activity of collecting data about data, {i.e.}, metadata. Most IT professionals and researchers who work with data have engaged in data profiling, at least informally, to understand and explore an unfamiliar dataset or to determine whether a new dataset is appropriate for a particular task at hand. Data profiling results are also important in a variety of other situations, including query optimization, data integration, and data cleaning. Simple metadata are statistics, such as the number of rows and columns, schema and datatype information, the number of distinct values, statistical value distributions, and the number of null or empty values in each column. More...

Proceedings of the 7th Ph.D. Retreat of the HPI Research School on Service-oriented Systems Engineering
  • Language: en
  • Pages: 218

Proceedings of the 7th Ph.D. Retreat of the HPI Research School on Service-oriented Systems Engineering

Design and Implementation of service-oriented architectures imposes a huge number of research questions from the fields of software engineering, system analysis and modeling, adaptability, and application integration. Component orientation and web services are two approaches for design and realization of complex web-based system. Both approaches allow for dynamic application adaptation as well as integration of enterprise application. Commonly used technologies, such as J2EE and .NET, form de facto standards for the realization of complex distributed systems. Evolution of component systems has lead to web services and service-based architectures. This has been manifested in a multitude of in...

Joint Workshop of the German Research Training Groups in Computer Science
  • Language: en
  • Pages: 261

Joint Workshop of the German Research Training Groups in Computer Science

description not available right now.

Advancing the Discovery of Unique Column Combinations
  • Language: en
  • Pages: 30

Advancing the Discovery of Unique Column Combinations

Unique column combinations of a relational database table are sets of columns that contain only unique values. Discovering such combinations is a fundamental research problem and has many different data management and knowledge discovery applications. Existing discovery algorithms are either brute force or have a high memory load and can thus be applied only to small datasets or samples. In this paper, the wellknown GORDIAN algorithm and "Apriori-based" algorithms are compared and analyzed for further optimization. We greatly improve the Apriori algorithms through efficient candidate generation and statistics-based pruning methods. A hybrid solution HCAGORDIAN combines the advantages of GORDIAN and our new algorithm HCA, and it significantly outperforms all previous work in many situations.

Machine Learning and Knowledge Discovery in Databases. Research Track
  • Language: en
  • Pages: 514

Machine Learning and Knowledge Discovery in Databases. Research Track

description not available right now.

The Semantic Web: ESWC 2014 Satellite Events
  • Language: en
  • Pages: 538

The Semantic Web: ESWC 2014 Satellite Events

  • Type: Book
  • -
  • Published: 2014-10-15
  • -
  • Publisher: Springer

This book constitutes the thoroughly refereed post-conference proceedings of the Satellite Events of the 11th International Conference on the Semantic Web, ESWC 2014, held in Anissaras, Crete, Greece, in May 2014. The volume contains 20 poster and 43 demonstration papers, selected from 113 submissions, as well as 12 best workshop papers selected from 60 papers presented at the workshop at ESWC 2014. Best two papers from AI Mashup Challenge are also included. The papers cover various aspects of the Semantic Web.

Modeling and enacting complex data dependencies in business processes
  • Language: en
  • Pages: 52

Modeling and enacting complex data dependencies in business processes

Enacting business processes in process engines requires the coverage of control flow, resource assignments, and process data. While the first two aspects are well supported in current process engines, data dependencies need to be added and maintained manually by a process engineer. Thus, this task is error-prone and time-consuming. In this report, we address the problem of modeling processes with complex data dependencies, e.g., m:n relationships, and their automatic enactment from process models. First, we extend BPMN data objects with few annotations to allow data dependency handling as well as data instance differentiation. Second, we introduce a pattern-based approach to derive SQL queries from process models utilizing the above mentioned extensions. Therewith, we allow automatic enactment of data-aware BPMN process models. We implemented our approach for the Activiti process engine to show applicability.

The Four Generations of Entity Resolution
  • Language: en
  • Pages: 152

The Four Generations of Entity Resolution

Entity Resolution (ER) lies at the core of data integration and cleaning and, thus, a bulk of the research examines ways for improving its effectiveness and time efficiency. The initial ER methods primarily target Veracity in the context of structured (relational) data that are described by a schema of well-known quality and meaning. To achieve high effectiveness, they leverage schema, expert, and/or external knowledge. Part of these methods are extended to address Volume, processing large datasets through multi-core or massive parallelization approaches, such as the MapReduce paradigm. However, these early schema-based approaches are inapplicable to Web Data, which abound in voluminous, noi...

Data Exploration Using Example-Based Methods
  • Language: en
  • Pages: 146

Data Exploration Using Example-Based Methods

Data usually comes in a plethora of formats and dimensions, rendering the exploration and information extraction processes challenging. Thus, being able to perform exploratory analyses in the data with the intent of having an immediate glimpse on some of the data properties is becoming crucial. Exploratory analyses should be simple enough to avoid complicate declarative languages (such as SQL) and mechanisms, and at the same time retain the flexibility and expressiveness of such languages. Recently, we have witnessed a rediscovery of the so-called example-based methods, in which the user, or the analyst, circumvents query languages by using examples as input. An example is a representative o...