JPA View
Purpose#
The JPA View is component responsible for creating read-projections of tasks and business data entries. It currently implements Datapool View API and Taskpool API and persists the projection as entities and relations in a RDBMS using JPA. It is useful if the JPA persistence is already used in the project setup.
Features#
- stores representation of business data entries
- stores representation of process definitions
- stores representation of process instances
- provides single query API supporting single and subscription queries
Configuration options#
In order to activate the JPA View implementation, please include the following dependency on your classpath:
<dependency>
<groupId>io.holunda.polyflow</groupId>
<artifactId>polyflow-view-jpa</artifactId>
<version>${polyflow.version}</version>
</dependency>
The implementation relies on Spring Data JPA and needs to activate those.
@Configuration
@EnablePolyflowJpaView
public class MyViewConfiguration {
}
In addition, configure a database connection to database using application.properties
or application.yaml
:
spring:
jpa:
show-sql: false
open-in-view: true # disable JPA warning
datasource:
url: <jdbc-connnection-string>
username: <db-user>
password: <db-password>
The JPA view uses a special facility for creating search indexes on unstructured payload. For this purpose
it converts the payload into a recursive map structure (in which every primitive type is a leaf and every
complex type is decomposed via the map) using Jackson ObjectMapper and then create search indexes for all
property paths (myObj1.myProperty2.myOtherEmbeddedProperty3
) and their values. You can provide some
configuration of this indexing process by the following configuration options:
polyflow.view.jpa:
stored-items: task, data-entry, process-instance, process-definition
payload-attribute-level-limit: 2
payload-attribute-column-length: 255
include-correlated-data-entries-in-data-entry-queries: false
process-outdated-events: false
data-entry-filters:
include: myProperty2.myOtherEmbeddedProperty3, myProperty2.myOtherEmbeddedProperty2
# exclude: myProperty
task-filters:
exclude: processVariableWithVeryLongText
In the example above you see the configuration of the limit of keying depth and usage of include/exclude filters of the keys.
In addition, the stored-items
property is holding a set of items to be persisted to the database. The possible values of
stored items are: task
, data-entry
, process-instance
and process-definition
. By setting this property, you can disable
storage of items not required by your application and save space consumption of your database. The property defaults to data-entry
.
With the payload-attribute-column-length
property one can specify a maximum length for payload attribute values if they are strings. Values that exceed
this length will automatically be trimmed to the max length in order to prevent exceptions when handling the event. This is especially necessary because
relational databases have limits on the length of composite primary keys. Since the combination of (id, path, value) for tasks or (id, type, path, value) for
data entries must be unique, the primary key is very large, which limits the amount of space available for the value.
The include-correlated-data-entries-in-data-entry-queries
flag controls whether a data entry query (DataEntriesForUserQuery
or DataEntriesQuery
) considers
the payload of correlated data entries. The data entry attributes (such as entry_type
, state.state
, ...) of correlated data entries are not considered.
Note: Only one level of correlation depth is considered here and there is no option yet to change the depth.
With the property process-outdated-events
you can configure the view such that all events are processed, even when the event timestamp is older than a different event
that was already processed. This might be helpful when doing technical updates but should be used with care as old event will override more recent changes if the
order is not guaranteed. Defaults to false
.
The attributes data-entry-filters
and task-filters
hold include
/ exclude
lists of property paths which will be taken in
consideration during the search index creation.
Note
Please make sure you understand that the payload enrichment performed during collection and indexing for search are two different operations. It is perfectly fine to have a large JSON payload attached to the task, but it makes no sense to make the entire payload searchable, at lease using JPA View.
Entity Scan#
The JPA View utilizes Spring Data repositories and Hibernate entities inside the persistence layer. As a result, it declares a @EntityScan
and @EnableJpaRepositories
annotations pointing at the corresponding locations. If you are using Spring Data JPA on your own, you will
need to add the @EntityScan
and @EnableJpaRepositores
annotation pointing at your packages. In addition, please check
Persistence configuration.
Logging#
The view implementation provides runtime details using standard logging facility. If you
want to increase the logging level, please setup it e.g. in your application.yaml
:
logging.level:
io.holunda.polyflow.view.jpa: DEBUG
DB Tables#
The JPA View uses several tables to store the results. These are:
PLF_DATA_ENTRY
: table for business data entriesPLF_DATA_ENTRY_AUTHORIZATIONS
: table for authorization information of data entriesPLF_DATA_ENTRY_PAYLOAD_ATTRIBUTES
: table for data entry attribute search indexPLF_DATA_ENTRY_PROTOCOL
: table for data entry protocol entry (users, groups)PLF_PROC_DEF
: table for process definitionsPLF_PROC_DEF_AUTHORIZATIONS
: table for authorization information of process definitionsPLF_PROC_INSTANCE
: table for process instancesPLF_TASK
: table for user tasksPLF_TASK_AUTHORIZATIONS
: table for authorization information of user tasksPLF_TASK_CORRELATIONS
: table for user task correlation informationPLF_TASK_PAYLOAD_ATTRIBUTES
: table for user task attribute search indexPLF_VIEW_TASK_AND_DATA_ENTRY_PAYLOAD
: view for convenient taskWithDataEntry queries executionPLF_DATA_ENTRY_PAYLOAD_ATTRIBUTES
: view for convenient data entry queries with correlationsTRACKING_TOKEN
: table for Axon Tracking Tokens
If you are interested in DDLs for the view, feel free to generate one using the following call of Apache Maven
mvn -Pgenerate-sql -f view/jpa
. Currently, DDLs for the databases H2, MSSQL and PostgreSQL are generated into target/
directory.
The DDL for the PLF_VIEW_TASK_AND_DATA_ENTRY_PAYLOAD
and PLF_DATA_ENTRY_PAYLOAD_ATTRIBUTES
cannot be auto-generated, therefore you need to use the following statements to create them:
create view PLF_VIEW_TASK_AND_DATA_ENTRY_PAYLOAD as
((select pc.TASK_ID, dea.PATH, dea.VALUE
from PLF_TASK_CORRELATIONS pc
join PLF_DATA_ENTRY_PAYLOAD_ATTRIBUTES dea on pc.ENTRY_ID = dea.ENTRY_ID and pc.ENTRY_TYPE = dea.ENTRY_TYPE)
union
select * from PLF_TASK_PAYLOAD_ATTRIBUTES);
create view PLF_VIEW_DATA_ENTRY_PAYLOAD as (
select *
from PLF_DATA_ENTRY_PAYLOAD_ATTRIBUTES
union
(select ec.OWNING_ENTRY_ID as ENTRY_ID,
ec.OWNING_ENTRY_TYPE as ENTRY_TYPE,
ep.path as PATH,
ep.value as VALUE
from PLF_DATA_ENTRY_CORRELATIONS ec
join PLF_DATA_ENTRY_PAYLOAD_ATTRIBUTES ep
on
ec.ENTRY_ID = ep.ENTRY_ID and ec.ENTRY_TYPE = ep.ENTRY_TYPE)
)