MAS Manage

At what point is it necessary to partition a MAS Manage workload across more than one MAS instance?

A new MAS instance is required to run MAS Manage workloads at the point in which the DB server can no longer be scaled up. When the DB server can no longer be scaled up, the customer should plan to create a new MAS instance and move sites to the new MAS instance which will be using a new DB server.

Maximo Transaction latency

When describing Maximo transaction latency it is important to define the boundaries of what constitutes a standard or out-of-the-box Maximo transaction. The description below does just that.

Terminology

  • CRUD refers to create, update, delete operations
  • MBO stands for Maximo Business Objects (which are hierarchical in nature)
  • Transaction latency is defined as the elapsed time between when the Maximo server receives the transaction request to the time when the Maximo server has sent the response. For UI users this also includes the elapsed time between when the Save button is clicked to when control is returned to the user.

Definition of a transaction

An out of the box Maximo transaction is expected to complete with a latency of 2 seconds or less, where a transaction is defined as the creation, update, or deletion of a single MBO, containing no more than one child object and with no attachments or binary data (blobs). Example include, but are not limited to:

  1. Creation of a single WorkOrder object. This includes generation of the WorkOrderStatus and WorkOrderAncestor records.
  2. Update of a single WorkOrder object. For example, changing states from Approved to Closed.
  3. Deletion of a single WorkOrder object.

Definition restrictions

The following conditions are considered to be outside the scope of an out of the box Maximo transaction, and therefore do not fall under the 2 second latency characterization.

  1. For UI initiated transactions this does not include latency incurred from downloading UI resources (e.g. js, css, png, jpg, etc.)
  2. It applies to out of the box Maximo applications, but not customized applications or out of the box Maximo applications with automation scripts
  3. It does not apply to UI initiated transactions with a large number of xhr requests (or portlets), for example the Maximo start center. Large here means greater than 2 xhr requests per page. Note, Maximo currently supports HTTP/1.1, so xhr requests initiated from a single user UI page are sequential, not concurrent.
  4. It does not apply to customized saved queries.
  5. It does not apply to bulk load requests.
  6. It does not apply to report related transactions

App Server

MAS Manage has different bundle types e.g. All, UI, MEA, Report and CRON to configure app server. Adjust the resource settings like cpu, memory, replic to match the workload. The settings are in ManageWorkspaces CR. Below is the sample.

apiVersion: apps.mas.ibm.com/v1
kind: ManageWorkspace
...
spec:
 settings:
    deployment:
      serverBundles:
        - bundleType: mea
          isDefault: false
          isMobileTarget: false
          isUserSyncTarget: true
          name: mea
          replica: 1
          routeSubDomain: all
        - bundleType: cron
          isDefault: false
          isMobileTarget: false
          isUserSyncTarget: false
          name: cron
          replica: 1
...
spec:
    settings:
        resources:
            manageAdmin:
                limits:
                cpu: '2'
                memory: 4Gi
                requests:
                cpu: '0.2'
                memory: 500Mi
            serverBundles:
                limits:
                cpu: '6'
                memory: 10Gi
                requests:
                cpu: '0.2'
                memory: 1Gi

Load-Balancer

Lab test shows roundrobin has more stable and better performance than leastconn policy which is the default. Follow this link to update load balancer policy.

Manage Pod Functionality

Follow this link to understand the manage pod functionality.

LTPA timeout

Using IBM Maximo Application Suite (MAS), Manage users will receive an error message saying to reload the application after 2 hours, even while actively working. This 2-hour timeout default is when the LTPA token in Manage expires, and is redirecting the user back to the login page for MAS. Follow Updating LTPA timeout in Manage to increase the default value.

WebSphere Liberty

Due to the architecture change, Maximo 8.x (MAS Manage app) is deployed on WebSphere Liberty Base 21.0.0.5 with OpenJ9. As of WebSphere Liberty 18.0.0.1, the thread growth algorithm is enhanced to grow thread capacity more aggressively to react more rapidly to peak loads. For many environments, this autonomic tuning provided by the Open Liberty thread pool works well with no configuration or tuning by the operator. If necessary, you can adjust coreThreads and maxThreads value by tuning liberty.

Configure JVM options in Manage app

Follow this link to configure JVM options

DB

Disk

disk performance is critial for db performance. Recommend a storage or disk with

  • disk throughput: > 250 MB/s
  • IOPS: 10 IOPS/GB to 100 IOPS/GB (depending on volume size)

To measure disk performance on Linux use the dd command. The sample command below measures disk performance of the data volume inside a db2 pod running in OCP

CAUTION

Make sure that ddtest filename is appended to the end of the data path or the dd command will wipe the db2 data directory.

[db2inst1@c-db2wh-manage-db2u-0 - Db2U bludata0]$ dd if=/dev/zero of=path_of_db2_data_directory/ddtest bs=128K count=8192
8192+0 records in
8192+0 records out
1073741824 bytes (1.1 GB, 1.0 GiB) copied, 2.84314 s, 378 MB/s
[db2inst1@c-db2wh-manage-db2u-0 - Db2U bludata0]$

Use the sar command line tool to check for disk performance. The example below shows that disk dev8-64 is reporting as 100% utilized and with a svctm of 677.48 milliseconds.

$ sar -d 1

Average:          DEV       tps     rkB/s     wkB/s   areq-sz    aqu-sz     await     svctm     %util
Average:    dev202-16      2.29     50.81      0.00     22.23      0.00      0.81      0.17      0.04
Average:    dev202-112      0.00      0.00      0.00      0.00      0.00      0.00      0.00      0.00
Average:     dev202-0     71.62      0.19   2717.52     37.95      0.26      3.64      0.15      1.09
Average:    dev202-32     26.67      0.00    257.90      9.67      0.03      1.17      0.20      0.52
Average:     dev253-0     59.57      0.00    257.90      4.33      0.10      1.64      0.10      0.59
Average:       dev8-0      0.05      0.00      0.00      0.00      0.00      0.00      1.00      0.00
Average:      dev8-16      0.05      0.00      0.00      0.00      0.00      0.00      1.00      0.00
Average:     dev253-1      0.00      0.00      0.00      0.00      0.00      0.00      0.00      0.00
Average:      dev8-32      0.05      0.00      0.00      0.00      0.00      1.00      1.00      0.00
Average:     dev253-2      2.57      0.00     10.86      4.22      0.00      1.04      0.98      0.25
Average:      dev8-48      2.62      0.00     10.86      4.15      0.00      0.96    381.85    100.01
Average:      dev8-64      1.48      0.00     40.19     27.23      0.00      1.06    677.48    100.01
Average:     dev253-3      1.43      0.00     40.19     28.13      0.00      1.20      0.73      0.10
Average:      dev8-80      0.05      0.00      0.00      0.00      0.00      1.00      1.00      0.00

Network latency between app and db server

Reducing network latency is key to optimizing performance. Confirm latency is below 50ms by conducting a ping test. For production env, strongly recommend keeping the latency below 10ms and having app and db server in the same network segment. In cloud deployment scenarios, ensure both the database and OpenShift cluster are located within the same region, with the possibility of being in the same availability zone (AZ). Utilize the ping command to evaluate and pinpoint latency issues.

Large table optimization

When optimizing large tables in the Manage app, it is recommended to transfer these tables to a dedicated tablespace on high-throughput disks, coupled with a dedicated buffer cache for enhanced performance. The speed of the disks and the availability of memory play crucial roles in this optimization strategy. Additionally, ensure that index statistics are regularly updated, and address any problematic queries to further optimize the system.

DB - DB2/DB2wh

DB2 Tuning in Maximo 7.6.x Best practice is applicable.

IMPORTANT

The containerized DB2U and DB2WH deployments do NOT support text search (Regular DB2 has text search).
As a result, some queries may perform poorly on containerized DB2 relative to Oracle DB and SQL Server, which both support text search.

Searching records by Description on the list page is a typical scenario whose performance can benefit from text search capability of the database, especially if no other indexed attributes are included in the query.

Adding a non-unique index on Description can help if an exact search can be made (Maximo search type = EXACT or user types is "=" before the search string, eg =Text) or the search can be done based on the beginning of the string (user types '%' at end of the string, eg Text%). If possible, adding other fields to the query (either by user typing them or as part of the default, where those attributes are part of an index can also help. In addition, adding Description to the end of one of these indexes can also show improvement.

Highlights:

  • increase maxsequence cache to 50
  • run runstats and/or reorg to update index periodically
  • separate system storage, user storage, backup storage, transaction logs storage, temporary tablespace storage on different disks if possible.
  • Use DB2 Performance Diagnosis to troubleshoot and tuning the db and SQL.
  • Manage requires row-organized tables. Check db2w db setting (by default it uses column based) and update the setting by db2 update db cfg using DFT_TABLE_ORG ROW
  • Manage does NOT support MMP or table partition in the current version, but consider to archive records over 1-year old. Optim is the one of the tools can be used for archiving. see this guide and this video for details.
  • Increase the concurrently running statements allowed for a DB2 application. This issue occcurs when loading a large amount of data via MIF or api call. See this link for the tuning.
  • storageclass
    • for ibm cloud: performance block storage (custom or gold, depending on volume size), block silver for backup
    • for aws cloud: if using EFS for db, consider Provisioned mode to have a constant throughput. For more disk options, see details in this page
  • For db2 registry (db2set):
    • Set db2_workload=maximo. That makes db cfg variable WLM_ADMISSION_CTRL is set to NO
    • Do NOT change the default values for DB2_OVERRIDE_NUM_CPUS and DB2_OVERRIDE_THREADING_DEGREE.
  • Verify db2 db cfg variable WLM_ADMISSION_CTRL is set to NO
  • For db2ucluster CR
    • Do NOT set db2 instance memory. The operator will automatically calculate it based on the container memory limit.
    • (Optional) for performance stability, set the same value to both container resource request and limit.
  • For db2 monitor switches
    • The best practice is turn off all monitor switches except the Timestamp in dbm cfg.
    • If turn on monitor switches in dbm cfg, will cause monitor switches are turned on by default for all DB2 sessions, this will bring 5%-10% overhead on the overall database performance, depends on the workload and database server hardware spec. So we should not turn on monitor switches in dbm cfg.
    • When we need to take DB2 monitor data, we should only turn on monitor switches in a specific session by the following command:
      db2 update monitor switches using BUFFERPOOL on LOCK on SORT on STATEMENT on TIMESTAMP on TABLE on UOW on
      And turn off all monitor switches immediately after getting required monitor data by the following command:
      db2 update monitor switches using BUFFERPOOL off LOCK off SORT off STATEMENT off TIMESTAMP off TABLE off UOW off

DB - Oracle

DB - MSSQL

  • Maximo 7.6.x Best practice is applicable

  • additional settings for MSSQL Server 2019

    • compatibility level: if maximo db is upgraded from the old version and the performance degradation is observed after the upgrade, consider to set compatibility level to the old version to keep the execution plan same.
    • isolation level:
              ALTER DATABASE <DB NAME>  
              SET ALLOW_SNAPSHOT_ISOLATION ON  
      
              ALTER DATABASE <DB NAME> 
              SET READ_COMMITTED_SNAPSHOT ON  
      
  • recommend to place tempdb on dedicated fast disks.

Note:

  • It is highly recommended to refresh index statistics after migrating Maximo from version 7 to version 8. In some cases, rebuilding indexes may also be necessary.