You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/start/modelling/fulltext.md
+8-8Lines changed: 8 additions & 8 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -2,7 +2,7 @@
2
2
3
3
CrateDB features **native full‑text search** powered by **Apache Lucene** and Okapi BM25 ranking, fully accessible via SQL. You can blend this seamlessly with other data types—JSON, time‑series, geospatial, vectors and more—all in a single SQL query platform.
4
4
5
-
## 1. Data Types & Indexing Strategy
5
+
## Data Types & Indexing Strategy
6
6
7
7
* By default, all text columns are indexed as `plain` (raw, unanalyzed)—efficient for equality search but not suitable for full‑text queries
8
8
* To enable full‑text search, you must define a **FULLTEXT index** with an optional language **analyzer**, e.g.:
@@ -21,7 +21,7 @@ CREATE TABLE documents (
21
21
INDEX ft_all USING FULLTEXT(title, body) WITH (analyzer ='english');
CrateDB uses the SQL `MATCH` predicate to run full‑text queries against full‑text indices. It optionally returns a relevance score `_score`, ranked via BM25.
54
54
@@ -100,7 +100,7 @@ WHERE MATCH((ft_en, ft_de), 'jupm OR verwrlost') USING best_fields WITH (fuzzine
100
100
ORDER BY _score DESC;
101
101
```
102
102
103
-
## 4. Use Cases & Integration
103
+
## Use Cases & Integration
104
104
105
105
CrateDB is ideal for searching **semi-structured large text data**—product catalogs, article archives, user-generated content, descriptions and logs.
106
106
@@ -119,13 +119,13 @@ WHERE
119
119
120
120
This blend lets you query by text relevance, numeric filters, and spatial constraints, all in one.
121
121
122
-
## 5. Architectural Strengths
122
+
## Architectural Strengths
123
123
124
124
***Built on Lucene inverted index + BM25**, offering relevance ranking comparable to search engines.
125
125
***Scale horizontally across clusters**, while maintaining fast indexing and search even on high volume datasets.
126
126
***Integrated SQL interface**: eliminates need for separate search services like Elasticsearch or Solr.
***Hands‑On Academy Course**: explore FTS on real datasets (e.g. Chicago neighborhoods).
146
146
***CrateDB Community Insights**: real‑world advice and experiences from users.
147
147
148
-
## **8. Summary**
148
+
## **Summary**
149
149
150
150
CrateDB combines powerful Lucene‑based full‑text search capabilities with SQL, making it easy to model and query textual data at scale. It supports fuzzy matching, multi-language analysis, composite indexing, and integrates fully with other data types for rich, multi-model queries. Whether you're building document search, catalog lookup, or content analytics—CrateDB offers a flexible and scalable foundation.\
0 commit comments