Sketch Engine's Avatar

Sketch Engine

@sketchengine.eu

Sketch Engine is a linguistic search engine and corpus query system with text analysis tools and corpora in 100+ languages. Concordance, n-grams, term extraction, co-occurrences (Word Sketch) are only some of its features.

286
Followers
37
Following
80
Posts
21.11.2024
Joined
Posts Following

Latest posts by Sketch Engine @sketchengine.eu

Post image

The Telugu Web 2021 corpus, with 100+ million words and part-of-speech tagging, is now available in Sketch Engine! #corpuslinguistics, #digitalhumanities, #linguistics
www.sketchengine.eu/tetenten-tel...

05.03.2026 13:07 ๐Ÿ‘ 0 ๐Ÿ” 1 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Post image

Registration is open for Lexicom 2026 in Palermo ๐Ÿ‡ฎ๐Ÿ‡น! Since 2001, this workshop in #lexicography and #corpuslinguistics has welcomed 700+ participants worldwide. Join the community and take part in the next edition, 14โ€“18 September 2026.
๐Ÿ”— lexicom.courses/lexicom-2026...

27.02.2026 09:59 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Post image

Our new Chinese corpus in Traditional Chinese (็น้ซ”ๅญ—) is now available. It is part-of-speech tagged and partly annotated for topics and genres. A useful resource for research and language teaching. #corpuslinguistics #digitalhumanities
www.sketchengine.eu/zhtenten-chi...

23.02.2026 10:48 ๐Ÿ‘ 1 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Post image

Weโ€™ve published a new Chinese corpus in Simplified Chinese (็ฎ€ไฝ“ๅญ—). It is part-of-speech tagged and partly annotated for topics and genres. A useful resource for research and language technology. #corpuslinguistics #linguistics #nlp
www.sketchengine.eu/zhtenten-chi...

19.02.2026 13:12 ๐Ÿ‘ 3 ๐Ÿ” 1 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Post image

An example of Sketch Engine used outside the field of pure linguistics. This study in media discourse analysis will be published in @nature.com www.nature.com/articles/s41... #MediaRepresentation #discourseanalysis #corpuslinguistics

11.02.2026 11:55 ๐Ÿ‘ 2 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Post image

Weโ€™ve published the Urdu Corpus 2021 in Sketch Engine, with 328 million words and topic and genre classification. Urdu is the 11th most spoken language worldwide (Ethnologue, 2025).
๐Ÿ”— www.sketchengine.eu/urtenten-urd...
#corpuslinguistics #TextAnalysis #ุงุฑุฏูˆ

29.01.2026 12:45 ๐Ÿ‘ 2 ๐Ÿ” 1 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Post image

The new Latvian Corpus 2021 now available in Sketch Engine. The corpus is enriched with part-of-speech tagging and lemmatization. Perfect for #corpuslinguistics, #digitalhumanities, #linguistics, #lexicography, and #nlp.

22.01.2026 11:46 ๐Ÿ‘ 2 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Post image

๐Ÿ“ข Registration is open for Lexicom 2026 in Palermo ๐Ÿ‡ฎ๐Ÿ‡น! Apply for this hands-on workshop on #lexicography, #corpuslinguistics, and #dictionaries. Learn from experts, explore new tools, and build your skills.
๐Ÿ“… 14โ€“18 September 2026
๐Ÿ”— lexicom.courses/lexicom-2026...

19.01.2026 10:45 ๐Ÿ‘ 1 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Post image

You can search for multiple variants at the same time in the Word Sketch tool. Just add a comma between them โ€“ Christmas, Xmas โ€“ to see the results for both: ske.li/bav0

22.12.2025 11:04 ๐Ÿ‘ 1 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Post image

The Word Sketch tool automatically separates senses and organizes collocations by their meaning, so you can analyze exactly what you want: ske.li/064
#collocations

18.12.2025 11:12 ๐Ÿ‘ 0 ๐Ÿ” 1 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Post image

Take the step to become a master in corpus analysis and corpus building with Sketch Engine. Choose from our online or face-to-face course options. Learn more at www.sketchengine.eu/bootcamp
#corpuslinguistics #TextAnalysis #appliedlinguistics

17.12.2025 12:26 ๐Ÿ‘ 2 ๐Ÿ” 2 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Post image

Find all n-grams containing the word โ€œChristmasโ€: ske.li/060
#ngrams

16.12.2025 08:55 ๐Ÿ‘ 2 ๐Ÿ” 1 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Post image

Parallel Concordance is an easy way to find translations of fixed expressions (idioms?) in multiple languages. Check out how others wish Merry Christmas at: ske.li/066

15.12.2025 11:06 ๐Ÿ‘ 1 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Post image

Lexicom 2026 is open for registration! Held in Palermo ๐Ÿ‡ฎ๐Ÿ‡น, 14โ€“18 September. Join 700+ graduates worldwide who have already attended this workshop in #lexicography, #corpuslinguistics, #dictionaries, and lexical computing.
๐Ÿ”— lexicom.courses/lexicom-2026...

12.12.2025 11:50 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Post image

Word Sketch can group collocations by meaning, so you can instantly tell which "bow" they belong to. The #collocations related to a decorative bow are highlighted in blue at ske.li/068
#wordsense

12.12.2025 11:30 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Post image

Curious how many words start with "snow"? Snowball, snowman, snowflake... Use the Wordlist tool to generate the full list from our billion-word corpora: ske.li/bam9

12.12.2025 09:19 ๐Ÿ‘ 2 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Post image

Among the 800+ corpora that we offer, you can also find many surprises โ€“ such as OpenSubtitles, corpora made up of translated movie subtitles. Can you guess the movie where Christmas is mentioned the most? ske.li/bam7
#corpuslinguistics #opensubtitles

10.12.2025 16:12 ๐Ÿ‘ 1 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Post image

The Word Sketch tool organizes every collocation into clear grammatical categories, making it easy to navigate through the data: ske.li/06q

09.12.2025 14:18 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Post image

Compare all your favorite Christmas treats in one go. Use the "From this list" feature in the Wordlist tool to analyze the whole dessert table simultaneously: ske.li/banf
#wordlist

08.12.2025 09:47 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Post image

With the English Trends corpus, you can compare the most prominent topics of each month. See what we typically talk about in December: ske.li/ban0
#trendingwords #trendingtopics

07.12.2025 19:09 ๐Ÿ‘ 1 ๐Ÿ” 1 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Post image

Translated into over 300 languages, Silent Night is one of the most famous carols. Check out its #translations to other languages in the parallel concordance: ske.li/065
#paralleltexts

06.12.2025 21:46 ๐Ÿ‘ 1 ๐Ÿ” 2 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Post image

What sets Christmas apart from other holidays? Find unique collocations for each of them in the Word Sketch Difference tool: ske.li/babp

05.12.2025 22:32 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Post image

Word Sketches can also be generated for multi-word expressions. Find the strongest collocations for โ€œChristmas treeโ€ at ske.li/06v
#collocation
www.sketchengine.eu/guide/word-s...

04.12.2025 17:04 ๐Ÿ‘ 4 ๐Ÿ” 2 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Post image

While we can't find two identical snowflakes, you can find words similar to "snowflake" when we look in a thesaurus: ske.li/06s
#thesaurus #similarwords

03.12.2025 12:28 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Post image

No existing corpus that fits your niche research topic? Build your own corpus! With seed words, the corpus theme might be anything โ€“ even Christmas. www.sketchengine.eu/guide/create...
#textdata #textcorpus

02.12.2025 14:26 ๐Ÿ‘ 1 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 1
Post image

Our Advent 2025 series starts today!๐ŸŽ‡
See how often snow appeared in past years with the Timeline tool in our Trends corpora: ske.li/06x

More small insights with Sketch Engine coming soon.
#corpuslinguistics #languagedata

01.12.2025 11:46 ๐Ÿ‘ 2 ๐Ÿ” 1 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Sketch Engine towel on the shore of Lake Bled, Slovenia.

Sketch Engine towel on the shore of Lake Bled, Slovenia.

Sketch Engine in โˆ’2 ยฐC air and +11 ยฐC water. Still working ๐Ÿ˜‰A good memory of our eLex 2025 days in Bled ๐Ÿ‡ธ๐Ÿ‡ฎ
Photo by Madis Jรผrviste, the Institute of the Estonian Language.

27.11.2025 12:06 ๐Ÿ‘ 1 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Post image

It was a pleasure to meet friends, users and the wider lexicographic community. Thanks for visiting our booth.
elex.link/elex2025
#elex2025 #lexicography

20.11.2025 14:08 ๐Ÿ‘ 2 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Post image

The Sketch Engine team is at the eLex conference in Bled, Slovenia!
Thanks to Michael Rundell for his fascinating talk on the changes in lexicography.
If you want to hear more from us, be sure to catch Ondล™ej Herman's talk on the development of monitor corpora today at 17:30!
elex.link/elex2025/

19.11.2025 11:53 ๐Ÿ‘ 2 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Post image

A major milestone for our English Trends corpus: 100 billion tokens (โ‰ˆ 86 billion words) since 2014, with 70 million words added each week. Itโ€™s available with a free trial.
๐Ÿ”— www.sketchengine.eu/english-tren...
#corpuslinguistics #bigdata #language

13.11.2025 13:06 ๐Ÿ‘ 1 ๐Ÿ” 1 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0