Custom queries

与所有其他 Spring Data 模块一样，Spring Data Neo4j 允许你在存储库中指定自定义查询。如果你无法通过派生查询函数来表达查找器逻辑，那么这些会派上用场。

Spring Data Neo4j, like all the other Spring Data modules, allows you to specify custom queries in you repositories. Those come in handy if you cannot express the finder logic via derived query functions.

因为 Spring Data Neo4j 主要在底层工作上以记录为导向，所以记住这一点并不要为同一个“根节点”构建一个包含多个记录的结果集非常重要。

Because Spring Data Neo4j works heavily record-oriented under the hood, it is important to keep this in mind and not build up a result set with multiple records for the same "root node".

请在常见问题解答中了解如何使用来自存储库的自定义查询的替代形式，特别是如何将自定义查询与自定义映射结合使用：Custom queries and custom mappings。

Please have a look in the FAQ as well to learn about alternative forms of using custom queries from repositories, especially how to use custom queries with custom mappings: Custom queries and custom mappings.

Queries with relationships

Beware of the cartesian product

假设您有一个像 MATCH (m:Movie{title: 'The Matrix'})←[r:ACTED_IN]-(p:Person) return m,r,p 这样的查询，结果是这样的：

Assuming you have a query like MATCH (m:Movie{title: 'The Matrix'})←[r:ACTED_IN]-(p:Person) return m,r,p that results into something like this:

Multiple records (shortened)

+------------------------------------------------------------------------------------------+
| m        | r                                    | p                                      |
+------------------------------------------------------------------------------------------+
| (:Movie) | [:ACTED_IN {roles: ["Emil"]}]        | (:Person {name: "Emil Eifrem"})        |
| (:Movie) | [:ACTED_IN {roles: ["Agent Smith"]}] | (:Person {name: "Hugo Weaving})        |
| (:Movie) | [:ACTED_IN {roles: ["Morpheus"]}]    | (:Person {name: "Laurence Fishburne"}) |
| (:Movie) | [:ACTED_IN {roles: ["Trinity"]}]     | (:Person {name: "Carrie-Anne Moss"})   |
| (:Movie) | [:ACTED_IN {roles: ["Neo"]}]         | (:Person {name: "Keanu Reeves"})       |
+------------------------------------------------------------------------------------------+

映射的结果很可能无法使用。如果将其映射到列表中，它将包含 Movie 的重复项，但这部电影只会有一种关系。

The result from the mapping would be most likely unusable. If this would get mapped into a list, it will contain duplicates for the Movie but this movie will only have one relationship.

Getting one record per root node

要返回正确对象，需要在查询中 collect 关系和相关节点： MATCH (m:Movie{title: 'The Matrix'})←[r:ACTED_IN]-(p:Person) return m,collect(r),collect(p)

To get the right object(s) back, it is required to collect the relationships and related nodes in the query: MATCH (m:Movie{title: 'The Matrix'})←[r:ACTED_IN]-(p:Person) return m,collect(r),collect(p)

Single record (shortened)

+------------------------------------------------------------------------+
| m        | collect(r)                     | collect(p)                 |
+------------------------------------------------------------------------+
| (:Movie) | [[:ACTED_IN], [:ACTED_IN], ...]| [(:Person), (:Person),...] |
+------------------------------------------------------------------------+

通过这样的结果作为单条记录，Spring Data Neo4j 才有可能将所有相关节点正确添加到根节点。

With this result as a single record it is possible for Spring Data Neo4j to add all related nodes correctly to the root node.

Reaching deeper into the graph

上述示例假设你只尝试提取相关节点的第一级。这有时是不够的，图中可能还有更多的节点，这些节点也应该成为映射实例的一部分。有两种方法可以实现此目的：数据库端还原或客户端端还原。

The example above assumes that you are only trying to fetch the first level of related nodes. This is sometimes not enough and there are maybe nodes deeper in the graph that should also be part of the mapped instance. There are two ways to achieve this: Database-side or client-side reduction.

为此，上述示例还应该在 Movie 初始返回的 Persons 上包含 Movies。

For this the example from above should also contain Movies on the Persons that get returned with the initial Movie.

Figure 1. Example for 'The Matrix' and 'Keanu Reeves'

Database-side reduction

记住 Spring Data Neo4j 只能正确处理基于记录的，一个实体实例的结果需要在一份记录中。使用 Cypher’s path 功能是获取图中所有分支的有效选项。

Keeping in mind that Spring Data Neo4j can only properly process record based, the result for one entity instance needs to be in one record. Using Cypher’s path capabilities is a valid option to fetch all branches in the graph.

Naive path-based approach

MATCH p=(m:Movie{title: 'The Matrix'})<-[:ACTED_IN]-(:Person)-[:ACTED_IN*..0]->(:Movie)
RETURN p;

这会产生多个路径，这些路径不会在一条记录中合并。可以调用 collect(p)，但是 Spring Data Neo4j 不理解映射过程中路径的概念。因此，需要为结果提取节点和关系。

This will result in multiple paths that are not merged within one record. It is possible to call collect(p) but Spring Data Neo4j does not understand the concept of paths in the mapping process. Thus, nodes and relationships needs to get extracted for the result.

Extracting nodes and relationships

MATCH p=(m:Movie{title: 'The Matrix'})<-[:ACTED_IN]-(:Person)-[:ACTED_IN*..0]->(:Movie)
RETURN m, nodes(p), relationships(p);

由于有多条路径从《黑客帝国》通往其他电影，因此结果仍然不是单条记录。此时， Cypher’s reduce function 便发挥了作用。

Because there are multiple paths that lead from 'The Matrix' to another movie, the result still won’t be a single record. This is where Cypher’s reduce function comes into play.

Reducing nodes and relationships

MATCH p=(m:Movie{title: 'The Matrix'})<-[:ACTED_IN]-(:Person)-[:ACTED_IN*..0]->(:Movie)
WITH collect(p) as paths, m
WITH m,
reduce(a=[], node in reduce(b=[], c in [aa in paths | nodes(aa)] | b + c) | case when node in a then a else a + node end) as nodes,
reduce(d=[], relationship in reduce(e=[], f in [dd in paths | relationships(dd)] | e + f) | case when relationship in d then d else d + relationship end) as relationships
RETURN m, relationships, nodes;

reduce 函数允许我们展平不同路径中的节点和关系。结果，我们将获得类似于 Getting one record per root node 的元组，但集合中混合了关系类型或节点。

The reduce function allows us to flatten the nodes and relationships from various paths. As a result we will get a tuple similar to Getting one record per root node but with a mixture of relationship types or nodes in the collections.

Client-side reduction

如果应在客户端发生规约，Spring Data Neo4j 可以让你还能映射关系或节点的列表列表。尽管如此，仍然要求返回的记录应包含所有信息，以便正确填充结果实体实例。

If the reduction should happen on the client-side, Spring Data Neo4j enables you to map also lists of lists of relationships or nodes. Still, the requirement applies that the returned record should contain all information to hydrate the resulting entity instance correctly.

Collect nodes and relationships from path

MATCH p=(m:Movie{title: 'The Matrix'})<-[:ACTED_IN]-(:Person)-[:ACTED_IN*..0]->(:Movie)
RETURN m, collect(nodes(p)), collect(relationships(p));

额外的 collect 语句按以下格式创建列表：

The additional collect statement creates lists in the format:

[[rel1, rel2], [rel3, rel4]]

这些列表现在将在映射过程中转换为平面列表。

Those lists will now get converted during the mapping process into a flat list.

根据要生成的数据量来决定是否选择客户端或数据库端缩减。在使用 reduce 函数时，首先需要在数据库内存中创建所有路径。另一方面，需要在客户器端进行合并的大量数据可能会导致其内存使用量急剧提高。

Deciding if you want to go with client-side or database-side reduction depends on the amount of data that will get generated. All the paths needs to get created in the database’s memory first when the reduce function is used. On the other hand a large amount of data that needs to get merged on the client-side results in a higher memory usage there.

Using paths to populate and return a list of entities

给定一个如下所示的图形：

Given are a graph that looks like this:

Figure 2. graph with outgoing relationships

以及一个领域模型，如 custom-query.paths.dm 所示（为了简洁，省略了构造函数和访问器）：

and a domain model as shown in the custom-query.paths.dm (Constructors and accessors have been omitted for brevity):

Domain model for a graph with outgoing relationships.

Unresolved include directive in modules/ROOT/pages/appendix/custom-queries.adoc - include::example$integration/issues/gh2210/SomeEntity.java[]

Unresolved include directive in modules/ROOT/pages/appendix/custom-queries.adoc - include::example$integration/issues/gh2210/SomeRelation.java[]

如你所见，关系只是传出的。生成的查找器方法（包括 findById）将始终尝试匹配要映射的根节点。从那里开始，将映射所有相关对象。在应仅返回一个对象的查询中，该根对象将被返回。在返回多个对象的查询中，将返回所有匹配的对象。当然也会填充从这些返回的对象出发的和入发的关系。

As you see, the relationships are only outgoing. Generated finder methods (including findById) will always try to match a root node to be mapped. From there on onwards, all related objects will be mapped. In queries that should return only one object, that root object is returned. In queries that return many objects, all matching objects are returned. Out- and incoming relationships from those objects returned are of course populated.

假设以下 Cypher 查询：

Assume the following Cypher query:

MATCH p = (leaf:SomeEntity {number: $a})-[:SOME_RELATION_TO*]-(:SomeEntity)
RETURN leaf, collect(nodes(p)), collect(relationships(p))

它遵循 Getting one record per root node 的建议，并且对你要在这里匹配的叶子节点非常有用。但是：这仅适用于所有返回 0 个或 1 个映射对象的场景。虽然该查询会像以前一样填充所有关系，但它不会返回全部 4 个对象。

It follows the recommendation from Getting one record per root node and it works great for the leaf node you want to match here. However: That is only the case in all scenarios that return 0 or 1 mapped objects. While that query will populate all relationships like before, it won’t return all 4 objects.

这可以通过返回整个路径来更改：

This can be changed by returning the whole path:

MATCH p = (leaf:SomeEntity {number: $a})-[:SOME_RELATION_TO*]-(:SomeEntity)
RETURN p

在这里，我们确实想要利用这样一个事实，即路径 p 实际上返回 3 行，其中包含指向全部 4 个节点的路径。全部 4 个节点将被填充、链接在一起并返回。

Here we do want to use the fact that the path p actually returns 3 rows with paths to all 4 nodes. All 4 nodes will be populated, linked together and returned.

Parameters in custom queries

你执行此操作的方式与在 Neo4j 浏览器或 Cypher-Shell 中发出的标准 Cypher 查询完全相同，语法为 $（从 Neo4j 4.0 开始，数据库中已删除 Cypher 参数的旧 ${foo} 语法）。

You do this exactly the same way as in a standard Cypher query issued in the Neo4j Browser or the Cypher-Shell, with the $ syntax (from Neo4j 4.0 on upwards, the old ${foo} syntax for Cypher parameters has been removed from the database).

ARepository.java

Unresolved include directive in modules/ROOT/pages/appendix/custom-queries.adoc - include::example$documentation/repositories/domain_events/ARepository.java[]

1	在此，我们按名称引用参数。您还可使用 `$0` 等。
2	Here we are referring to the parameter by its name. You can also use `$0` etc. instead.

您需要使用 -parameters 编译您的 Java 8+ 项目，以便在没有更多注释的情况下使命名参数生效。Spring Boot Maven 和 Gradle 插件会自动为您执行此操作。如果由于某种原因此操作不可行，您可以添加 @Param 并显式指定名称，或使用参数索引。

You need to compile your Java 8+ project with -parameters to make named parameters work without further annotations. The Spring Boot Maven and Gradle plugins do this automatically for you. If this is not feasible for any reason, you can either add @Param and specify the name explicitly or use the parameters index.

作为参数传递给带有自定义查询注释的函数的映射实体（带 @Node 的所有内容）将变为嵌套映射。以下示例表示 Neo4j 参数的结构。

Mapped entities (everything with a @Node) passed as parameter to a function that is annotated with a custom query will be turned into a nested map. The following example represents the structure as Neo4j parameters.

给定一个如 movie-model 所示使用注释的 Movie、Vertex 和 Actor 类：

Given are a Movie, Vertex and Actor classes annotated as shown in movie-model:

"Standard" movies model

@Node
public final class Movie {

    @Id
    private final String title;

    @Property("tagline")
    private final String description;

    @Relationship(value = "ACTED_IN", direction = Direction.INCOMING)
    private final List<Actor> actors;

    @Relationship(value = "DIRECTED", direction = Direction.INCOMING)
    private final List<Person> directors;
}

@Node
public final class Person {

    @Id @GeneratedValue
    private final Long id;

    private final String name;

    private Integer born;

    @Relationship("REVIEWED")
    private List<Movie> reviewed = new ArrayList<>();
}

@RelationshipProperties
public final class Actor {

	@RelationshipId
	private final Long id;

    @TargetNode
    private final Person person;

    private final List<String> roles;
}

interface MovieRepository extends Neo4jRepository<Movie, String> {

    @Query("MATCH (m:Movie {title: $movie.__id__})\n"
           + "MATCH (m) <- [r:DIRECTED|REVIEWED|ACTED_IN] - (p:Person)\n"
           + "return m, collect(r), collect(p)")
    Movie findByMovie(@Param("movie") Movie movie);
}

将 Movie 的实例传递给上面的存储库方法将生成以下 Neo4j 映射参数：

Passing an instance of Movie to the repository method above, will generate the following Neo4j map parameter:

{
  "movie": {
    "__labels__": [
      "Movie"
    ],
    "__id__": "The Da Vinci Code",
    "__properties__": {
      "ACTED_IN": [
        {
          "__properties__": {
            "roles": [
              "Sophie Neveu"
            ]
          },
          "__target__": {
            "__labels__": [
              "Person"
            ],
            "__id__": 402,
            "__properties__": {
              "name": "Audrey Tautou",
              "born": 1976
            }
          }
        },
        {
          "__properties__": {
            "roles": [
              "Sir Leight Teabing"
            ]
          },
          "__target__": {
            "__labels__": [
              "Person"
            ],
            "__id__": 401,
            "__properties__": {
              "name": "Ian McKellen",
              "born": 1939
            }
          }
        },
        {
          "__properties__": {
            "roles": [
              "Dr. Robert Langdon"
            ]
          },
          "__target__": {
            "__labels__": [
              "Person"
            ],
            "__id__": 360,
            "__properties__": {
              "name": "Tom Hanks",
              "born": 1956
            }
          }
        },
        {
          "__properties__": {
            "roles": [
              "Silas"
            ]
          },
          "__target__": {
            "__labels__": [
              "Person"
            ],
            "__id__": 403,
            "__properties__": {
              "name": "Paul Bettany",
              "born": 1971
            }
          }
        }
      ],
      "DIRECTED": [
        {
          "__labels__": [
            "Person"
          ],
          "__id__": 404,
          "__properties__": {
            "name": "Ron Howard",
            "born": 1954
          }
        }
      ],
      "tagline": "Break The Codes",
      "released": 2006
    }
  }
}

一个节点由一个映射表示。该映射将始终包含 id，它是映射的 ID 属性。在 labels 下将可以使用所有标签（静态和动态）。所有属性以及关系类型都将出现在这些映射中，就像 SDN 编写实体时它们在图形中出现一样。值将具有正确的 Cypher 类型，无需进一步转换。

A node is represented by a map. The map will always contain id which is the mapped id property. Under labels all labels, static and dynamic, will be available. All properties - and type of relationships - appear in those maps as they would appear in the graph when the entity would have been written by SDN. Values will have the correct Cypher type and won’t need further conversion.

所有关联关系都是地图列表。动态关系将相应地得到解决。一对一关系也将序列化为单例列表。因此，要访问人与人之间的单一对一映射，应该编写此 $person.properties.BEST_FRIEND[0].target.id。

All relationships are lists of maps. Dynamic relationships will be resolved accordingly. One-to-one relationships will also be serialized as singleton lists. So to access a one-to-one mapping between people, you would write this das $person.properties.BEST_FRIEND[0].target.id.

如果一个实体与不同类型的其他节点有相同类型的关系，它们都会出现在同一个列表中。如果你需要这样的映射，并且还需要处理这些自定义参数，则必须相应地展开它。执行此操作的一种方法是使用关联子查询（需要 Neo4j 4.1+）。

If an entity has a relationship with the same type to different types of others nodes, they will all appear in the same list. If you need such a mapping and also have the need to work with those custom parameters, you have to unroll it accordingly. One way to do this are correlated subqueries (Neo4j 4.1+ required).

Spring Expression Language in custom queries

Spring 表达式语言 (SpEL) 可用于 :#{} 中的自定义查询。这里的冒号指的是一个参数，此类表达式应使用在参数有意义的地方。但是，当使用我们的 literal extension 时，您可以在标准 Cypher 不允许参数（例如对于标签或关系类型）的地方使用 SpEL 表达式。这是定义要进行 SpEL 评估的查询中文本块的标准 Spring Data 方式。

Spring Expression Language (SpEL) can be used in custom queries inside :#{}. The colon here refers to a parameter and such an expression should be used where parameters make sense. However, when using our literal-extension you can use SpEL expression in places where standard Cypher won’t allow parameters (such as for labels or relationship types). This is the standard Spring Data way of defining a block of text inside a query that undergoes SpEL evaluation.

以下示例基本上定义了与上述内容相同的一个查询，但它使用 WHERE 子句来避免使用更多的大括号：

The following example basically defines the same query as above, but uses a WHERE clause to avoid even more curly braces:

ARepository.java

Unresolved include directive in modules/ROOT/pages/appendix/custom-queries.adoc - include::example$documentation/repositories/domain_events/ARepository.java[]

SpEL 块以 :#{`开头，然后按名称引用指定的 `String`参数 (#pt1`)。不要将这与上述 Cypher 语法搞混！SpEL 表达式将两个参数串联成一个最终传递给 appendix/neo4j-client.adoc#neo4j-client的单个值。SpEL 块以 `}`结尾。

The SpEL blocked starts with :#{ and then refers to the given String parameters by name (#pt1). Don’t confuse this with the above Cypher syntax! The SpEL expression concatenates both parameters into one single value that is eventually passed on to the appendix/neo4j-client.adoc#neo4j-client. The SpEL block ends with }.

SpEL 还解决了另外两个问题。我们提供了两个扩展，可将 `Sort`对象传递到自定义查询中。还记得 custom queries中的 faq.adoc#custom-queries-with-page-and-slice-examples吗？通过 `orderBy`扩展，您可以使用动态排序将 `Pageable`传递到自定义查询中：

SpEL also solves two additional problems. We provide two extensions that allow to pass in a Sort object into custom queries. Remember faq.adoc#custom-queries-with-page-and-slice-examples from custom queries? With the orderBy extension you can pass in a Pageable with a dynamic sort to a custom query:

orderBy-Extension

import org.springframework.data.domain.Pageable;
import org.springframework.data.domain.Sort;
import org.springframework.data.neo4j.repository.Neo4jRepository;
import org.springframework.data.neo4j.repository.query.Query;

public interface MyPersonRepository extends Neo4jRepository<Person, Long> {

    @Query(""
        + "MATCH (n:Person) WHERE n.name = $name RETURN n "
        + ":#{orderBy(#pageable)} SKIP $skip LIMIT $limit" (1)
    )
    Slice<Person> findSliceByName(String name, Pageable pageable);

    @Query(""
        + "MATCH (n:Person) WHERE n.name = $name RETURN n :#{orderBy(#sort)}" (2)
    )
    List<Person> findAllByName(String name, Sort sort);
}

1	在 SpEL 上下文中，`Pageable` 始终名为 `pageable`。
2	A `Pageable` has always the name `pageable` inside the SpEL context.
3	在 SpEL 上下文中，`Sort` 始终名为 `sort`。
4	A `Sort` has always the name `sort` inside the SpEL context.

Spring Expression Language extensions

Literal extension

literal 扩展可以用来在自定义查询中使标签或关系类型“动态化”。在 Cypher 中既不能对标签也不能对关系类型进行参数化，因此必须将它们指定为文字。

The literal extension can be used to make things like labels or relationship-types "dynamic" in custom queries. Neither labels nor relationship types can be parameterized in Cypher, so they must be given literal.

literal-Extension

interface BaseClassRepository extends Neo4jRepository<Inheritance.BaseClass, Long> {

    @Query("MATCH (n:`:#{literal(#label)}`) RETURN n") (1)
    List<Inheritance.BaseClass> findByLabel(String label);
}

1	`literal` 扩展将替换为已评估参数的字面值。
2	The `literal` extension will be replaced with the literal value of the evaluated parameter.

在此处，literal 值已被用于根据标签动态匹配。如果您将 SomeLabel 作为参数传递到该方法，则会生成 MATCH (n:`SomeLabel) RETURN n`。已添加单引号来对值进行正确的转义。SDN 不会为您执行此操作，因为这可能不是您始终想要的结果。

Here, the literal value has been used to match dynamically on a Label. If you pass in SomeLabel as a parameter to the method, MATCH (n:`SomeLabel) RETURN n` will be generated. Ticks have been added to correctly escape values. SDN won’t do this for you as this is probably not what you want in all cases.

List extensions

对于多个值，allOf 和 anyOf 可以即时呈现所有值的 & 或 | 连接的列表。

For more than one value there are allOf and anyOf in place that would render either a & or | concatenated list of all values.

List extensions

interface BaseClassRepository extends Neo4jRepository<Inheritance.BaseClass, Long> {

    @Query("MATCH (n:`:#{allOf(#label)}`) RETURN n")
    List<Inheritance.BaseClass> findByLabels(List<String> labels);

    @Query("MATCH (n:`:#{anyOf(#label)}`) RETURN n")
    List<Inheritance.BaseClass> findByLabels(List<String> labels);
}

Referring to Labels

您已经知道如何将节点映射到一个域对象：

You already know how to map a Node to a domain object:

A Node with many labels

@Node(primaryLabel = "Bike", labels = {"Gravel", "Easy Trail"})
public class BikeNode {
    @Id String id;

    String name;
}

此节点带有几个标签，并且在自定义查询中始终重复所有标签非常容易出错：您可能会忘记一个或输入一个错误。我们提供了以下表达式来缓解此问题：#{#staticLabels}。请注意这一选项_不_以冒号开头！它用于具有 @Query 注释的存储库方法：

This node has a couple of labels, and it would be rather error prone to repeat them all the time in custom queries: You might forget one or make a typo. We offer the following expression to mitigate this: #{#staticLabels}. Notice that this one does not start with a colon! You use it on repository methods annotated with @Query:

#{#staticLabels} in action

public interface BikeRepository extends Neo4jRepository<Bike, String> {

    @Query("MATCH (n:#{#staticLabels}) WHERE n.id = $nameOrId OR n.name = $nameOrId RETURN n")
    Optional<Bike> findByNameOrId(@Param("nameOrId") String nameOrId);
}

此查询将解析为

This query will resolve to

MATCH (n:`Bike`:`Gravel`:`Easy Trail`) WHERE n.id = $nameOrId OR n.name = $nameOrId RETURN n

请注意我们如何为 nameOrId 使用标准参数：在大多数情况下，无需通过添加 SpEL 表达式来增加这里的复杂性。

Notice how we used standard parameter for the nameOrId: In most cases there is no need to complicate things here by adding a SpEL expression.