Linting Models

This guide describes how to apply optional linters to a Smithy model.

Linting overview

A linter is a special kind of model validator that is configurable. Linter implementations are found in code. The smithy-linters package in Maven Central contains several linters that can be used to apply additional validation to Smithy models.

The following example adds smithy-linters as a dependency to a Smithy project:

smithy-build.json
{
    "...": "...",
    "maven": {
        "dependencies": [
            "software.amazon.smithy:smithy-linters:1.65.0"
        ]
    },
    "...": "..."
}
build.gradle.kts
dependencies {
    ...
    implementation("software.amazon.smithy:smithy-linters:1.65.0")
}
build.gradle
dependencies {
    ...
    implementation 'software.amazon.smithy:smithy-linters:1.65.0'
}

After the dependency is added and available on the Java classpath, validators defined in the package and registered using Java SPI are available for use in Smithy models.

Linters in smithy-linters

For Java developers using Smithy's reference implementation, the following linters (except for UnreferencedShape) are defined in the software.amazon.smithy:smithy-linters Maven package.

UnreferencedShape

Emits when a shape is not connected to the rest of the model. If no configuration is provided, the linter will check if a shape is connected to the closure of any service shape. A selector can be provided to define a custom set of "root" shapes to customize how the linter determines if a shape is unreferenced. Shapes that are connected through the idRef trait are considered connected.

Rationale

Just like unused variables in code, removing unused shapes from a model makes the model easier to maintain.

Default severity

NOTE

Configuration

Property

Type

Description

rootShapeSelector

string

A selector that specifies the root shape(s) from which to detect if other shapes are connected. Defaults to "service", meaning any shape connected to any service shape in the model is considered referenced.

Example:

$version: "2"

metadata validators = [
    {
        // Find shapes that aren't connected to service shapes or shapes
        // marked with the trait, smithy.example#myCustomTrait.
        name: "UnreferencedShape"
        configuration: {
            rootShapeSelector: ":is(service, [trait|smithy.example#myCustomTrait])"
        }
    }
]

Note

For backward compatibility reasons, the UnreferencedShape validator is available in software.amazon.smithy:smithy-model Maven package and does not require additional dependencies.

AbbreviationName

Validates that shape names and member names do not represent abbreviations with all uppercase letters. For example, instead of using "XMLRequest" or "instanceID", this validator recommends using "XmlRequest" and "instanceId".

Rationale

Using a strict form of camelCase where abbreviations are written just like other words makes names more predictable and easier to work with in tooling. For example, a tool that generates code in Python might wish to represent camelCase words using snake_case; utilizing strict camel casing makes it easier to split words apart.

Default severity

DANGER

Configuration

Property

Type

Description

allowedAbbreviations

[ string ]

A case-insensitive list of abbreviations to allow to be all capital letters. Defaults to an empty list.

Example:

$version: "2"

metadata validators = [
    {name: "AbbreviationName"}
]

CamelCase

Validates that shape names and member names adhere to a consistent style of camel casing. By default, this validator will ensure that shape names use UpperCamelCase, trait shape names use lowerCamelCase, and that member names use either lowerCamelCase or UpperCamelCase (depending on which is currently more prevalent in each service closure).

Rationale

Utilizing a consistent camelCase style makes it easier to understand a model and can lead to consistent naming in code generated from Smithy models.

Default severity

DANGER

Configuration

Property

Type

Description

memberNames

string

Specifies the camelCase style of member names. Can be set to either "upper", "lower", or "auto" (the default).

Example:

$version: "2"

metadata validators = [
    {name: "CamelCase"}
]

MissingSensitiveTrait

This validator scans shape or member names and identifies ones that look like they could contain sensitive information but are not marked with the @sensitive trait. This does not apply to shapes where the @sensitive trait would be invalid. Users may also configure this validator with a custom list of terms, and choose to ignore the built-in defaults. The defaults terms include types of personal information such as 'birth day', 'billing address', 'zip code', or 'gender', as well as information that could be maliciously exploited such as 'password', 'secret key', or 'credit card'.

Rationale

Sensitive information often incurs legal requirements regarding the handling and logging of it. Mistakenly not marking sensitive data accordingly carries a large risk, and it is helpful to have an automated validator to catch instances of this rather than rely on best efforts.

Default severity

WARNING

Configuration

Property

Type

Description

terms

[ string ]

A list of search terms that match shape or member names case-insensitively based on word boundaries (for example, the term "access key id" matches "AccessKeyId", "access_key_id", and "accesskeyid"). See Words boundary matching for details.

excludeDefaults

boolean

A flag indicating whether or not to disregard the default set of terms. This property is not required and defaults to false. If set to true, terms must be provided.

Example:

$version: "2"

metadata validators = [{
    name: "MissingSensitiveTrait"
    configuration: {
        excludeDefaults: false,
        terms: ["home planet"]
    }
}]

NoninclusiveTerms

Validates that all text content in a model (i.e. shape names, member names, documentation, trait values, etc.) does not contain words that perpetuate cultural biases. This validator has a built-in set of bias terms that are commonly found in APIs along with suggested alternatives.

Noninclusive terms are case-insensitively substring matched and can have any number of leading or trailing whitespace or non-whitespace characters.

This validator has built-in mappings from noninclusive terms to match model text to suggested alternatives. The configuration allows for additional terms to suggestions mappings to either override or append the built-in mappings. If a match occurs and the suggested alternatives is empty, no suggestion is made in the generated warning message.

Rationale

Intent doesn't always match impact. The use of noninclusive language like "whitelist" and "blacklist" perpetuates bias through past association of acceptance and denial based on skin color. Other words should be used that are not only inclusive, but more clearly communicate meaning. Words like allowList and denyList much more clearly indicate that something is allowed or denied.

Default severity

WARNING

Configuration

Property

Type

Description

terms

{ keyword -> [ alternatives ] }

A set of noninclusive terms to suggestions to either override or replace the built-in mappings. This property is not required unless excludeDefaults is true. The default value is the empty set.

excludeDefaults

boolean

A flag indicating whether or not the mappings set specified by terms configuration replaces the built-in set or appends additional mappings. This property is not required and defaults to false.

Example:

$version: "2"

metadata validators = [{
    name: "NoninclusiveTerms"
    configuration: {
        excludeDefaults: false,
        terms: {
            mankind: ["humankind"],
            mailman: ["mail carrier", "postal worker"]
        }
    }
}]

ReservedWords

Validates that shape names and member names do not match a configured set of reserved words.

Rationale

Tools that generate code from Smithy models SHOULD automatically convert reserved words into symbols that are safe to use in the targeted programming language. This validator can be used to warn about these conversions as well as to prevent sensitive words, like internal code-names, from appearing in public artifacts.

Default Severity

DANGER

Configuration

A single key, reserved, is Required in the configuration. Its value is a list of objects with the following properties:

Property

Type

Description

words

[ string ]

A list of words that shape or member names MUST not case-insensitively match. Supports a leading and trailing wildcard character of "*". See Wildcards in ReservedWords for details.

terms

[ string ]

A list of search terms that match shape or member names case-insensitively based on word boundaries (for example, the term "access key id" matches "AccessKeyId", "access_key_id", and "accesskeyid"). See Words boundary matching for details.

selector

string

Specifies a selector of shapes to validate for this configuration. Defaults to validating all shapes, including member names.

Note

When evaluating member shapes, the member name will be evaluated instead of the shape name.

reason

string

A reason to display for why this set of words is reserved.

Example:

$version: "2"

metadata validators = [{
    id: "FooReservedWords"
    name: "ReservedWords"
    configuration: {
        reserved: [
            {
                words: ["Codename"]
                reason: "This is the internal project name."
            }
        ]
    }
}]

Wildcards in ReservedWords

The ReservedWords validator allows leading and trailing wildcard characters to be specified.

  • Using both a leading and trailing wildcard indicates that shape or member names match when case-insensitively containing the word. The following table shows matches for a reserved word of *codename*:

    Example

    Result

    CreateCodenameInput

    Match

    CodenameResource

    Match

    ReferencedCodename

    Match

    Codename

    Match

  • Using a leading wildcard indicates that shape or member names match when case-insensitively ending with the word. The following table shows matches for a reserved word of *codename:

    Example

    Result

    CreateCodenameInput

    No match

    CodenameResource

    No match

    ReferencedCodename

    Match

    Codename

    Match

  • Using a trailing wildcard indicates that shape or member names match when case-insensitively starting with the word. The following table shows matches for a reserved word of codename*:

    Example

    Result

    CreateCodenameInput

    No match

    CodenameResource

    Match

    ReferencedCodename

    No Match

    Codename

    Match

  • Using no wildcards indicates that shape or member names match when case-insensitively the same as the word. The following table shows matches for a reserved word of codename:

    Example

    Result

    CreateCodenameInput

    No match

    CodenameResource

    No match

    ReferencedCodename

    No match

    Codename

    Match

Words boundary matching

Word boundaries can be used to find terms of interest. Word boundary search text consists of one or more alphanumeric words separated by a single space. When comparing against another string, the contents of the string are separated into words based on word boundaries. Those words are case-insensitively compared against the words in the search text for a match.

Word boundaries are detected when the casing between two characters changes, or the type of character between two characters changes. The following table demonstrates how comparison text is parsed into words.

Comparison text

Parsed words

accessKey

access key

accessKeyID

access key id

accessKeyIDValue

access key id value

accesskeyId

accesskey id

accessKey1

access key 1

access_keyID

access key id

The following table shows matches for a search term of secret id, meaning the word "secret" needs to be followed by the word "id". Word boundary searches also match if the search terms concatenated together with no spaces is considered a word in the search text (for example, secret id will match the word secretid).

Comparison text

Result

SomeSecretId

Match

SomeSecretIDValue

Match

SomeSecret__ID__value

Match

secret_id

Match

secret_id100

Match

secretid

Match

secretid_value

Match

secretidvalue

No Match

SecretThingId

No match

SomeSecretid

No match

Syntax restrictions

  • Empty search terms are not valid.

  • Only a single space can appear between words in word boundary patterns.

  • Leading and trailing spaces are not permitted in word boundary patterns.

  • Word boundary patterns can only contain alphanumeric characters.

StandardOperationVerb

Looks at each operation shape name and determines if the first word in the operation shape name is one of the defined standard verbs or if it is a verb that has better alternatives.

Note

Operations names MUST use a verb as the first word in the shape name in order for this validator to properly function.

Rationale

Using consistent verbs for operation shape names helps consumers of the API to more easily understand the semantics of an operation.

Default severity

DANGER

Configuration

Property

Type

Description

verbs

[ string ]

The list of verbs that each operation shape name MUST start with.

prefixes

[ string ]

A list of prefixes that MAY come before one of the valid verbs. Prefixes are often used to group families of operations under a common prefix (e.g., batch might be a common prefix in some organizations). Only a single prefix is honored.

suggestAlternatives

object

Used to recommend alternative verbs. Each key is the name of a verb that should be changed, and each value is a list of suggested verbs to use instead.

Note

At least one verb or one suggestAlternatives key-value pair MUST be provided.

Example:

$version: "2"

metadata validators = [{
    name: "StandardOperationVerb"
    configuration: {
        verbs: ["Register", "Deregister", "Associate"]
        prefixes: ["Batch"]
        suggestAlternatives: {
            "Make": ["Create"]
            "Transition": ["Update"]
        }
    }
}]

RepeatedShapeName

Validates that Structure member names and Union member names do not case-insensitively repeat their container shape names.

As an example, if a structure named "Table" contained a member named "TableName", then this validator would emit a WARNING event.

Rationale

Repeating a shape name in the members of identifier of the shape is redundant.

Default severity

WARNING

Configuration

Property

Type

Description

exactMatch

boolean

If set to true, the validator will only warn if the member name is case-insensitively identical to the containing shape's name.

InputOutputStructureReuse

Validates that every operation defines a dedicated input and output shape marked with the input trait and output trait.

Rationale
  1. Using the same structure for both input and output can lead to backward-compatibility problems in the future if the members or traits used in input needs to diverge from those used in output. It is always better to use structures that are exclusively used as input or exclusively used as output.

  2. Referencing the same input or output structure from multiple operations can lead to backward-compatibility problems in the future if the inputs or outputs of the operations ever need to diverge. By using the same structure, you are unnecessarily tying the interfaces of these operations together.

Default severity

DANGER

MissingPaginatedTrait

Checks for operations that look like they should be paginated but do not have the paginated trait.

Rationale

Paginating operations that can return potentially unbounded lists of data helps to maintain a predictable SLA and helps to prevent operational issues in the future.

Default severity

DANGER

Configuration

Property

Type

Description

verbsRequirePagination

[string]

Defines the case-insensitive operation verb prefixes for operations that MUST be paginated. A DANGER event is emitted for any operation that has a shape name that starts with one of these verbs. Defaults to ["list", "search"].

verbsSuggestPagination

[string]

Defines the case-insensitive operation verb prefixes for operations that SHOULD be paginated. A WARNING event is emitted when an operation is found that matches one of these prefixes, the operation has output, and the output contains at least one top-level member that targets a List. Defaults to ["describe", "get"]

inputMembersRequirePagination

[string]

Defines the case-insensitive operation input member names that indicate that an operation MUST be paginated. A DANGER event is emitted if an operation is found to have an input member name that case-insensitively matches one of these member names. Defaults to ["maxresults", "maxitems", "pagesize", "limit", "nexttoken", "pagetoken", "token", "marker"]

outputMembersRequirePagination

[string]

Defines the case-insensitive operation output member names that indicate that an operation MUST be paginated. A DANGER event is emitted if an operation is found to have an output member name that case-insensitively matches one of these member names. Defaults to ["nexttoken", "pagetoken", "token", "marker", "nextpage", "nextpagetoken", "position", "nextmarker", "paginationtoken", "nextpagemarker"].

Example:

metadata validators = [
    {name: "MissingPaginatedTrait"}
]

ShouldHaveUsedTimestamp

Looks for shapes that likely represent time, but that do not use a timestamp shape.

The ShouldHaveUsedTimestamp validator checks the following names:

  • string shape names

  • short, integer, long, float, and double shape names

  • structure member names

  • union member names

The ShouldHaveUsedTimestamp validator checks each of the above names to see if they likely represent a time value. If a name does look like a time value, the shape or targeted shape MUST be a timestamp shape.

A name is assumed to represent a time value if it:

  • Begins or ends with the word "time"

  • Begins or ends with the word "date"

  • Ends with the word "at"

  • Ends with the word "on"

  • Ends with the word "since"

  • Contains the exact string "timestamp" or "Timestamp"

For the purpose of this validator, words are matched case insensitively. Words are separated by either an underscore character, or by mixed case characters. For example, "FooBar", "fooBar", "foo_bar", "Foo_Bar", and "FOO_BAR" all contain the same two words, "foo" and "bar".

Rationale

Smithy tooling can convert timestamp shapes into idiomatic language types that make them easier to work with in client tooling.

Default severity

DANGER

Configuration

Property

Type

Description

additionalPatterns

[ string ]

A list of regular expression patterns that identify names that represent time.

MissingClientOptionalTrait

Allows services to control backward compatibility guarantees for members marked as @required and @default by requiring the application of the @clientOptional trait.

Rationale

Different service providers have different backward compatibility guarantees for @required and @default structure members. Some services wish to reserve the right to remove the @required trait at any time, while others are able to strictly follow the backward-compatibility guarantees of the @required trait. For example, it is considered backward compatible to remove the @required trait from a member and replace it with the @default trait. However, this isn't possible for members that target structure or union shapes because they can have no default value. The risk associated with such members may be unacceptable for some services.

Default severity

DANGER

Configuration

Property

Type

Description

onRequiredOrDefault

boolean

Requires that members marked with the @required or @default trait are also marked with the @clientOptional trait.

onRequiredStructureOrUnion

boolean

Requires that @required members that target structure or union shapes are also marked with the @clientOptional trait. @required members that target structures and unions are risky because there is no backward compatible way to replace the @required trait with the @default trait if the member ever needs to be made optional.

The following example requires that @required members that target a structure or union are marked with the @clientOptional trait.

$version: "2"

metadata validators = [
    {
        name: "MissingClientOptionalTrait",

        // Limit validation to a specific set of namespaces.
        namespaces: ["smithy.example"],

        configuration: {
            onRequiredStructureOrUnion: true
        }
    }
]

This validation can be suppressed for any member that the service provider decides is not at risk of ever needing to become optional in the future:

structure Sprocket {
    @required
    @suppress(["MissingClientOptionalTrait"])
    owner: OwnerStructure
}

Writing custom validators

Custom validators can be written in Java to apply more advanced model validation. Writing a custom validator involves writing an implementation of a Smithy validator in Java, creating a JAR, and making the JAR available on the classpath.

Custom validators are implementations of the software.amazon.smithy.model.validation.Validator interface. Most validators should extend from software.amazon.smithy.model.validation.AbstractValidator.

The following linter emits a ValidationEvent for every shape in the model that is not documented.

package com.example.mypackage;

import java.util.List;
import java.util.stream.Collectors;
import software.amazon.smithy.model.Model;
import software.amazon.smithy.model.traits.DocumentationTrait;
import software.amazon.smithy.model.validation.AbstractValidator;
import software.amazon.smithy.model.validation.ValidationEvent;

public class DocumentationValidator extends AbstractValidator {
    @Override
    public List<ValidationEvent> validate(Model model) {
        return model.shapes()
                .filter(shape -> !shape.hasTrait(DocumentationTrait.class))
                .map(shape -> error(shape, "This shape is not documented!"))
                .collect(Collectors.toList());
    }
}

Validators need to be registered as Java service providers. Add the following class name to a file named software.amazon.smithy.model.validation.Validator found in the src/main/resources/META-INF/services directory of a standard Gradle Java package:

com.example.mypackage.DocumentationValidator

When added to the classpath (typically as a dependency of a published JAR), the custom validator is automatically applied to a model each time the model is loaded.

Writing custom Linters

Like custom validators, custom linters can be written in Java to apply more advanced model validation.

Custom linters are implementations of the software.amazon.smithy.model.validation.Validator interface. Because linters are configurable, they are created using an implementation of the software.amazon.smithy.model.validation.ValidatorService interface.

The following validator emits a ValidationEvent for every shape in the model that has documentation that contains a forbidden string.

package com.example.mypackage;

import java.util.List;
import java.util.Optional;
import java.util.stream.Collectors;
import java.util.stream.Stream;
import software.amazon.smithy.model.Model;
import software.amazon.smithy.model.node.NodeMapper;
import software.amazon.smithy.model.shapes.Shape;
import software.amazon.smithy.model.traits.DocumentationTrait;
import software.amazon.smithy.model.validation.AbstractValidator;
import software.amazon.smithy.model.validation.ValidationEvent;
import software.amazon.smithy.model.validation.ValidatorService;

public class ForbiddenDocumentationValidator extends AbstractValidator {

    /**
     * ForbiddenDocumentation configuration settings.
     */
    public static final class Config {
        private List<String> forbid;

        public List<String> getForbid() {
            return forbid;
        }

        public void setForbid(List<String> forbid) {
            this.forbid = forbid;
        }
    }

    // Does the actual work of converting metadata found in a Smithy
    // model into an actual implementation of a Validator.
    public static final class Provider extends ValidatorService.Provider {
        public Provider() {
            super(ForbiddenDocumentationValidator.class, configuration -> {
                // Deserialize the Node value into the Config POJO.
                NodeMapper mapper = new NodeMapper();
                ForbiddenDocumentationValidator.Config config = mapper.deserialize(configuration, Config.class);
                return new ForbiddenDocumentationValidator(config);
            });
        }
    }

    private final List<String> forbid;

    // The constructor is private since the validator is only intended to
    // be created when loading a model via the Provider class.
    private ForbiddenDocumentationValidator(Config config) {
        this.forbid = config.forbid;
    }

    @Override
    public List<ValidationEvent> validate(Model model) {
        // Find every shape that violates the linter and return a list
        // of ValidationEvents.
        return model.shapes()
                .filter(shape -> shape.hasTrait(DocumentationTrait.class))
                .flatMap(shape -> validateShape(shape).map(Stream::of).orElseGet(Stream::empty))
                .collect(Collectors.toList());
    }

    private Optional<ValidationEvent> validateShape(Shape shape) {
        // Grab the trait by type.
        DocumentationTrait trait = shape.expectTrait(DocumentationTrait.class);
        String docString = trait.getValue();

        for (String text : forbid) {
            if (docString.contains(text)) {
                // Emit an event that points at the location of the trait
                // and associates the warning with the shape.
                return Optional.of(warning(shape, trait, "Documentation uses forbidden text: " + text));
            }
        }

        return Optional.empty();
    }
}

Configurable linters need to be registered as Java service providers. Add the following class name to a file named software.amazon.smithy.model.validation.ValidatorService found in the src/main/resources/META-INF/services directory of a standard Gradle Java package:

com.example.mypackage.ForbiddenDocumentationValidator$Provider

When added to the classpath (typically as a dependency of a published JAR), the custom validator is available to be used as a validator. The following example warns each time the word "meow" appears in documentation:

$version: "2"

metadata validators = [
    {
        name: "ForbiddenDocumentation"
        configuration: {
            forbid: ["meow"]
        }
    }
]

Tip

The EmitEachSelector can get you pretty far without needing to write any Java code. For example, the above linter can be implemented using the following Smithy model:

$version: "2"

metadata validators = [
    {
        name: "EmitEachSelector"
        id: "ForbiddenDocumentation"
        message: "Documentation uses forbidden text"
        configuration: {
            selector: "[trait|documentation*='meow']"
        }
    }
]