Ontology buildingOntology design: Best practices and anti-patterns

Ontology design: Best practices and anti-patterns

Best practices

A well-designed Ontology creates a unified, intuitive representation of your organization that enables seamless data integration, cross-functional collaboration, and powerful analytics. The following principles establish the foundation for effective Ontology design:

Model reality, not systems: Object types should represent real-world entities, not individual source system or department representations.
Curate intentionally: Every property should have clear business or technical value.
Collaborate across teams: Ontology design should involve stakeholders from multiple departments or teams.
Keep object types focused: Each object type should represent one distinct entity.
Choose the right tool: Use action types for human or agentic decisions, pipelines for automated transformations.
Use interfaces for abstraction: When entities share common characteristics, model the abstraction with interfaces rather than creating wide, sparse object types.
Document your decisions: Document object types, properties, and links in Ontology Manager.

Anti-patterns

Even experienced Ontology designers can fall into common design traps that seem reasonable initially but create significant problems as the Ontology grows. This section identifies recurring anti-patterns, explains why they occur, and provides concrete guidance for avoiding or resolving them.

Avoiding these anti-patterns will help you build an Ontology that accurately represents your business domain, reduces maintenance overhead, and enables powerful cross-functional workflows.

Anti-pattern	Description	Solution
System Silos	Creating separate object types for each source system.	Merge data in pipelines; create unified object types.
The Kitchen Sink	Including unnecessary technical columns as properties.	Curate properties intentionally; exclude ETL metadata.
Department Silos	Each department creates their own version of shared entities.	Create shared object types; use properties and links for department-specific data.
The God Object	One object type represents multiple distinct entities.	Create distinct object types; use interfaces for shared characteristics.
The Golden Hammer	Using action types for everything, including automated transformations.	Use pipelines for deterministic transformations; reserve actions for human decisions.
Action Sprawl	Creating many single-property actions instead of cohesive business operations.	Design actions around business operations that bundle related changes into meaningful workflows.
The Time Machine	Modeling historical versions as separate objects or object types.	Use a single object per entity with linked history/amendment objects and time series properties.
The Misnomer	Using vague, generic, or misleading names for Ontology elements.	Use specific, descriptive names; qualify ambiguous properties; name links by relationship.

Anti-pattern: System Silos

System Silos occur when you create separate object types for the same real-world entity based on the source system the data originates from, rather than modeling the entity itself.

Common causes

Different teams own different source systems and build independently
Uncertainty about how to merge data from multiple sources
Desire to preserve system-specific fields without deciding what's essential

Example

Your organization has employee data in three systems: an HR system, a badge access system, and a project management tool. Instead of creating a single Employee object type, you create:

HR System Employee
Badge System Employee
Project Management Employee

Problems

Problem	Impact
Fragmented view of reality	End users cannot see a unified view of an employee; they must navigate multiple object types to understand the full picture.
Duplicated effort	Action types, link types, and applications must be built multiple times for what is conceptually the same entity.
Inconsistent data	The same employee may have conflicting information across object types with no clear source of truth.
Complex maintenance	Changes to business logic must be replicated across all system-specific object types.

Solution

Create a single object type representing the real-world entity and use data pipelines to merge information from multiple source systems into a unified backing dataset.

✗ Avoid                          ✓ Prefer
─────────────────────────────    ─────────────────────────────
HR System Employee               Employee
Badge System Employee         →  (backed by merged dataset
Project Management Employee      from all three systems)

To implement this:

Identify the primary key that uniquely identifies the entity across systems (for example, employee ID).
Build a transform that joins data from all source systems.
Define clear precedence rules for conflicting values (for example, HR system is authoritative for job title).
Create a single object type backed by the merged dataset.

Anti-pattern: The Kitchen Sink

This anti-pattern (also known as "everything but the kitchen sink") occurs when object types include unnecessary columns from external systems that have no business relevance in the Ontology context, cluttering the data model with technical artifacts.

Common causes

"Just in case" mentality (keeping fields that might be useful later)
Lack of clarity on what fields are meaningful
Direct mapping from source systems without curation
Fear of losing data by excluding columns

Example

When creating a Customer object type from a CRM system integration, you include all available columns:

customer_id ✓
customer_name ✓
email ✓
_crm_extracted_at ✗
_crm_received_at ✗
_crm_batched_at ✗
_crm_sequence ✗
_crm_table_version ✗
_crm_internal_record_id ✗
last_etl_update_timestamp ✗

Problems

Problem	Impact
Confusion	End users see irrelevant technical fields alongside business data.
Performance degradation	Unnecessary properties increase data scale, compute, index size, and slow down searches.
Obscured insights	Important business properties are buried among system metadata.

Solution

Curate properties intentionally. Only include columns that have clear business meaning and will be useful for workflows.

Use these guidelines when deciding which properties to include:

Include	Exclude
Business identifiers (customer ID, order number)	Pipeline metadata
Human-readable attributes (name, description)	Internal system IDs with no business meaning
Dates relevant to business processes	Timestamps only relevant to data engineering
Status fields needed for filtering or actions	Audit columns for pipeline debugging

To implement this:

Review each column and ask: "Would someone ever need to see, search, or filter by this?"
Keep technical metadata in the backing dataset for debugging, but do not expose it as properties.
Use property visibility settings to hide any borderline properties that must exist but are rarely needed.
Document why each property exists and who uses it.

Anti-pattern: Department Silos

Department Silos occur when different departments create their own versions of the same object type, leading to a fragmented Ontology that mirrors organizational structure rather than business reality.

Common causes

Departments work in isolation without cross-functional coordination
Each team believes their view of the customer is unique
Lack of governance or central Ontology design authority
Teams want autonomy and control over "their" data

Example

Multiple departments need to work with customer data, and each creates their own object type:

Sales team creates Sales Customer
Support team creates Support Customer
Finance team creates Billing Customer
Marketing team creates Marketing Contact

All four object types represent the same real-world entity: a customer.

Problems

Problem	Impact
No single source of truth	Different departments have conflicting information about the same customer.
Impossible cross-functional workflows	Cannot easily answer questions like "Show me all interactions with this customer across sales, support, and billing".
Duplicated development	Each department builds redundant actions, links, and applications.
Governance nightmare	Data quality issues multiply; fixes in one object type do not propagate to others.

Solution

Create shared object types that serve multiple departments, using properties and links to capture department-specific information where needed.

✗ Avoid                          ✓ Prefer
─────────────────────────────    ─────────────────────────────
Sales Customer                   Customer
Support Customer           →       ├── sales_status (property)
Billing Customer                   ├── support_tier (property)
Marketing Contact                  ├── billing_account_id (property)
                                   └── Links to:
                                       ├── Sales Opportunities
                                       ├── Support Tickets
                                       └── Invoices

To implement this:

Identify entities that exist across departmental boundaries.
Establish a cross-functional working group to define shared object types.
Use properties to capture department-specific attributes on shared objects.
Use link types to connect shared objects to department-specific objects (such as Customer → Support Ticket).
Leverage object views or curated Workshop and OSDK applications if departments need different "views" of the same underlying entity.
Use restricted views if specific properties can only be accessible by a specific team.

Anti-pattern: The God Object

The God Object anti-pattern occurs when a single object type is overloaded to represent multiple distinct real-world entities, resulting in a bloated, confusing, and unmaintainable object type.

Common causes

Over-abstraction driven by superficial similarities ("they are all assets")
Desire to minimize the number of object types
Lack of clear entity definitions before building
Scope creep as more use cases are added to an existing object type

Indicators

An object type has many properties that are frequently null
Property meanings change based on another property's value (such as type or category)
You find yourself asking "What kind of [Object] is this?" when viewing an object
Business rules and validations require extensive conditional logic based on object "type"

Example

You create an Asset object type intended to represent "anything valuable," which ends up including:

Physical equipment (trucks, machinery)
Software licenses
Real estate properties
Financial instruments
Employees (as "human assets")

The object type has 150+ properties, most of which are null for any given object, and the meaning of properties like value, location, and status varies completely depending on what kind of "asset" the object represents.

Problems

Problem	Impact
Semantic confusion	End users cannot understand what an `Asset` actually represents.
Sparse data	Most properties are null for most objects, making the data hard to interpret.
Impossible validation	Cannot enforce business rules because rules differ by entity type.
Poor search experience	Searching for `Assets` returns a mix of unrelated things.
Action type complexity	Actions must handle wildly different entity types with complex conditional logic.

Solution

Create distinct object types for distinct real-world entities. Use interfaces to model shared characteristics when entities genuinely share common properties or behaviors.

✗ Avoid                          ✓ Prefer
─────────────────────────────    ─────────────────────────────
Asset                            Equipment
  - asset_type                   Vehicle
  - asset_subtype                Software License
  - value                  →     Property (Real Estate)
  - location                     Financial Instrument
  - status
  - 145 more properties...       Interface: Depreciable Asset
                                   - purchase_date
                                   - purchase_value
                                   - depreciation_schedule

To implement this:

List the distinct real-world entities currently represented by the object type.
Create separate object types for each distinct entity.
Identify genuinely shared properties and behaviors.
Use interfaces to model shared characteristics across object types.
Migrate existing objects to appropriate new object types.

Anti-pattern: The Golden Hammer

The Golden Hammer anti-pattern occurs when you over-rely on a single tool (for example, action types) to solve every problem, even when other approaches are more appropriate. The name comes from the saying: "If all you have is a hammer, everything looks like a nail."

Common causes

Action types are well-understood and visible
Desire to give end users "control" over when calculations happen
Lack of familiarity with pipeline/transform capabilities
Ontology-first thinking without considering the full platform

Example

You need to calculate aggregate metrics for a dashboard showing total sales by region. Instead of using a data pipeline to pre-compute these metrics:

You create an action type called Calculate Regional Sales Totals
End users must manually trigger the calculation with a button
Results are written back to objects via the action

Similarly, instead of building a pipeline that cleanses data during ingestion for data quality, you create a Fix Data Quality Issues action type that end users must run manually.

Problems

Problem	Impact
Scalability limits	Action types have execution limits; pipelines can process millions of rows.
Unnecessary burden	End users must remember to trigger actions that should happen automatically.
Stale data	Derived data becomes outdated between manual action executions.
Performance issues	Real-time calculations via actions are slower than pre-computed pipeline results.

Solution

Choose the right tool for the job based on your use case:

Choose action types	Choose pipelines
A human decision is required.	Changes apply to many objects based on source system conditions.
The change is specific to one or few objects.	Changes can happen on a schedule (hourly, daily).
Input is needed.	No input is needed.
Changes should happen immediately in response to an action.

Examples of applying this guidance:

✗ Avoid                                    ✓ Prefer
────────────────────────────────────────   ────────────────────────────────────────
Action: "Calculate Regional Sales"    →    Pipeline that aggregates sales data daily
                                           Object type: Regional Sales Summary

Action: "Standardize Address Format"  →    Pipeline that cleanses addresses on ingestion

Action: "Update Inventory Status"     →    Pipeline that sets status based on quantity
(based on quantity thresholds)             thresholds during each sync

Action: "Assign Risk Score"           →    Pipeline or model that calculates risk scores
(using a formula)                          and writes to backing dataset

To implement this:

Before creating an action type, ask: "Does this require human judgment?"
Reserve action types for genuine decisions and edits.
Use scheduled pipelines for regular data updates and calculations.
Consider using Functions for complex derived calculations that need to be available in real-time.

Anti-pattern: Action Sprawl

Action Sprawl occurs when you create many narrowly-scoped action types that each modify a single property, rather than designing cohesive actions that represent meaningful business operations.

Common causes

Thinking of actions as database column updates rather than business operations
Building actions incrementally without considering the overall user experience
Lack of understanding of how actions can bundle multiple property changes
Mimicking CRUD operations from traditional application development

Indicators

More than 10 action types for a single object type
Multiple actions that are always performed in sequence
Action names that read like Set [Property] or Update [Property]
End users complaining about too many steps to complete a task

Example

For an Employee object type, instead of creating meaningful business actions, you create:

Update Employee First Name
Update Employee Last Name
Update Employee Email
Update Employee Phone
Update Employee Department
Update Employee Manager
...and 20 more single-property actions

Problems

Problem	Impact
Overwhelming experience	End users face a long, cluttered list of actions and struggle to find the right one.
Fragmented workflows	Simple updates require multiple action submissions to complete a single business task.
No cohesive business representation	Actions do not map to real-world processes, making the Ontology unintuitive.
Fragmented audit trails	History of changes is scattered across many small actions, making it difficult to understand what happened and why.

Solution

Design action types around business operations, not database updates. Create actions that bundle related changes into meaningful workflows.

✗ Avoid                                    ✓ Prefer
────────────────────────────────────────   ────────────────────────────────────────
Update Employee First Name                 Update Employee Contact Information
Update Employee Last Name            →       - first_name
Update Employee Email                        - last_name
Update Employee Phone                        - email
                                             - phone

Update Employee Department                 Transfer Employee to New Department
Update Employee Manager              →       - new_department
Update Employee Location                     - new_manager
                                             - new_location
                                             - effective_date

Create Employee Record                     Onboard New Employee
Set Employee Start Date              →       - All required fields for a new hire
Assign Employee Badge                        - Triggers downstream workflows
Assign Employee Equipment                    (badge assignment, equipment request)

To implement this:

Map out the real business processes that involve changing object data.
Group related property changes into single actions that represent those processes.
Use action parameters to allow optional fields within a cohesive action.
Name actions after the business operation: Transfer Employee, Approve Purchase Order, Escalate Support Ticket.
Use action rules and validation logic to enforce business constraints within the action.

Anti-pattern: The Time Machine

The Time Machine anti-pattern occurs when you model historical versions of an entity as separate objects or object types rather than using time series data, snapshots, or proper versioning strategies.

Common causes

Desire to preserve a complete history of every change
Misunderstanding of how to model temporal data in the Ontology
Applying file-versioning mental models (v1, v2, v3) to object design
Lack of awareness of time series properties or linked history patterns

Indicators

Object type contains multiple objects representing the same real-world entity at different points in time
Properties like version, revision, or is_current exist to distinguish copies
Object counts grow proportionally with the number of changes rather than the number of entities
End users are confused about which object to reference or link to

Example

To track changes to a Contract, you create:

Contract v1, Contract v2, Contract v3 as separate objects within the same object type
Or worse: Contract 2023, Contract 2024, Contract 2025 as separate object types for each year

Each "version" is a full copy of the contract with slightly different property values, and links to other objects (such as Vendor or Department) are duplicated across all versions.

Problems

Problem	Impact
Object count explosion	Every change creates a new object, rapidly inflating the Ontology with redundant data.
Ambiguous current state	It is difficult to identify which version is the "current" or authoritative version.
Ambiguous links	Links to contracts become unclear; which version should a `Vendor` or `Department` link to?
Complex reporting	Reporting across time periods requires filtering and deduplication logic that is error-prone.

Solution

Use a single object per entity with properties for current state. Store historical changes in a separate linked object type, enable edits history, or leverage time series properties.

✗ Avoid                                    ✓ Prefer
────────────────────────────────────────   ────────────────────────────────────────
Contract v1 (object)                       Contract (single object per contract)
Contract v2 (object)                 →       - current_value
Contract v3 (object)                         - current_status
                                             - effective_date
— OR —                                       - Links to:
                                               └── Contract Amendments
Contract 2023 (object type)                        - amendment_date
Contract 2024 (object type)                        - previous_value
Contract 2025 (object type)                        - new_value
                                                   - change_reason

To implement this:

Use a single object per real-world entity with properties reflecting the current state.
Create a separate linked object type (such as Contract Amendment or Contract History) to capture historical changes.
Leverage time series properties for values that change frequently and need temporal tracking.
Use the backing dataset or edits history to maintain full historical records for audit trails if needed.

Anti-pattern: The Misnomer

The Misnomer anti-pattern occurs when you use vague, generic, or misleading names for object types, properties, and link types that do not clearly communicate their meaning, leading to confusion and misinterpretation across the Ontology.

Common causes

Using shorthand names that make sense to you but not to others
Names are carried over directly from source system column names without translation
Desire for brevity over clarity
Lack of naming conventions or governance standards
Assumption that context will make meaning obvious

Indicators

End users frequently ask "What does this property mean?" or "What kind of [Object] is this?"
The same name could reasonably refer to multiple different concepts
Property names are single generic words like value, type, status, date, or name without qualification
Link types use generic labels like "related to" without specifying the nature of the relationship

Example

You create the following Ontology elements with ambiguous names:

Object type: Item (What kind of item? Product? Line item? Inventory item?)
Property: value (Monetary value? Quantity? Score? Rating?)
Property: type (Type of what? What are valid values?)
Property: date (Created date? Modified date? Due date? Effective date?)
Link type: Item → Related Item (How are they related? Parent-child? Substitute? Accessory?)

End users encountering these names must guess at their meaning or dig into documentation to understand what the data actually represents.

Problems

Problem	Impact
Misinterpretation	End users cannot understand the Ontology without additional context, leading to incorrect analysis and decisions.
Steep learning curve	New team members must spend significant time learning what vague names actually mean.
Documentation dependency	Documentation becomes essential rather than supplementary, and falls out of date quickly.
Cross-team confusion	Different teams interpret the same vague names differently, leading to inconsistent usage.

Solution

Use specific, descriptive names for all Ontology elements. Names should be self-documenting so that anyone can understand meaning without additional context.

✗ Avoid                                    ✓ Prefer
────────────────────────────────────────   ────────────────────────────────────────
Object type: Item                    →     Object type: Product
                                           Object type: Sales Order Line Item
                                           Object type: Warehouse Inventory Record

Property: value                      →     Property: monetary_value
                                           Property: quantity_on_hand
                                           Property: risk_score

Property: type                       →     Property: product_category
                                           Property: service_tier

Property: date                       →     Property: order_placed_date
                                           Property: contract_effective_date

Link: Item → Related Item            →     Link: Product → Purchasing Customer
                                           Link: Employee → Supervisor
                                           Link: Equipment → Manufacturing Facility

To implement this:

Establish naming conventions before building and enforce them through governance reviews.
Use specific, descriptive names: Product, Sales Order Line Item, Warehouse Inventory Record.
Qualify ambiguous properties: monetary_value, quantity_on_hand, risk_score.
Name links explaining the relationship: Purchasing Customers, Manufacturing Facility, Supervisor.
Add descriptions to all Ontology elements explaining their meaning and valid values.
Review names with end users to ensure they are intuitive and unambiguous.

Building a successful Ontology

The anti-patterns described in this guide are common but avoidable. By focusing on the fundamental best practices (modeling reality rather than systems, curating properties intentionally, collaborating across teams, and choosing the right tools for each task), you can build an Ontology that scales with your organization's needs.

Remember that effective Ontology design is iterative. Start with clear entity definitions, involve stakeholders early, and refine your model as you learn what works. When you encounter challenges, revisit the principles in this guide to identify whether an anti-pattern may be emerging and course-correct before it becomes difficult to change.

←

PREVIOUSInterfaces / Metadata reference

NEXTOntology search / Semantic search / Overview

→