532:
278:
Gensim library has been used and cited in over 1400 commercial and academic applications as of 2018, in a diverse array of disciplines from medicine to insurance claim analysis to patent search. The software has been covered in several new articles, podcasts and interviews.
226:
for performance. Gensim is designed to handle large text collections using data streaming and incremental online algorithms, which differentiates it from most other machine learning software packages that target only in-memory processing.
302:
Gensim is commercially supported by the company rare-technologies.com, who also provide student mentorships and academic thesis projects for Gensim via their
Student Incubator programme.
407:
602:
573:
597:
566:
248:
166:
340:
607:
34:
592:
559:
452:
219:
122:
99:
252:
208:
73:
440:
260:
244:
351:
204:
154:
196:
429:
149:
319:
266:
Some of the novel online algorithms in Gensim were also published in the 2011 PhD dissertation
138:
543:
212:
161:
129:
418:
20:
586:
292:
200:
531:
465:
105:
51:
41:
487:
498:
377:
363:
539:
476:
240:
236:
33:
296:
288:
256:
223:
515:
177:
142:
134:
378:"Scalability of Semantic Analysis in Natural Language Processing"
268:
Scalability of
Semantic Analysis in Natural Language Processing
366:. Proc. LREC Workshop on New Challenges for NLP Frameworks
364:
Software framework for topic modelling with large corpora
235:
Gensim includes streamed parallelized implementations of
547:
453:"DecisionStats Interview Radim Řehůřek Gensim #python"
172:
160:
148:
128:
118:
98:
72:
60:
50:
40:
287:The open source code is developed and hosted on
16:Vector space modeling and topic modeling toolkit
441:Interview with Radim Řehůřek, creator of Gensim
567:
396:software package that accompanies this thesis
8:
291:and a public support forum is maintained on
26:
574:
560:
211:functionalities, using modern statistical
32:
25:
270:of Radim Řehůřek, the creator of Gensim.
603:Python (programming language) libraries
311:
430:Podcast.__init__ episode #71 on Gensim
352:Deep learning with word2vec and Gensim
362:Radim Řehůřek and Petr Sojka (2010).
207:, retrieval by similarity, and other
7:
598:Natural language processing toolkits
528:
526:
477:Gensim mailing list on Google Groups
243:and doc2vec algorithms, as well as
546:. You can help Knowledge (XXG) by
14:
249:non-negative matrix factorization
530:
1:
419:Commercial adopters of Gensim
499:Gensim open source Incubator
466:Gensim source code on Github
283:Free and Commercial Support
253:latent Dirichlet allocation
209:natural language processing
624:
525:
488:Gensim chat room on Gitter
18:
408:Gensim academic citations
218:Gensim is implemented in
199:library for unsupervised
94:
79:4.3.2 / 24 August 2023
68:
31:
245:latent semantic analysis
19:Not to be confused with
376:Řehůřek, Radim (2011).
341:Scalable *2vec training
81:; 12 months ago
608:Science software stubs
56:RARE Technologies Ltd.
593:Free science software
155:Information retrieval
540:scientific software
28:
455:. 8 December 2015.
261:random projections
110:/RaRe-Technologies
42:Original author(s)
555:
554:
247:(LSA, LSI, SVD),
205:document indexing
190:
189:
615:
576:
569:
562:
534:
527:
519:
518:
516:Official website
501:
496:
490:
485:
479:
474:
468:
463:
457:
456:
449:
443:
438:
432:
427:
421:
416:
410:
405:
399:
398:
389:
387:
382:
373:
367:
360:
354:
349:
343:
338:
332:
331:
329:
327:
322:. 24 August 2023
316:
213:machine learning
186:
183:
181:
179:
130:Operating system
114:
111:
109:
107:
89:
87:
82:
36:
29:
623:
622:
618:
617:
616:
614:
613:
612:
583:
582:
581:
580:
523:
514:
513:
510:
505:
504:
497:
493:
486:
482:
475:
471:
464:
460:
451:
450:
446:
439:
435:
428:
424:
417:
413:
406:
402:
392:my open-source
385:
383:
380:
375:
374:
370:
361:
357:
350:
346:
339:
335:
325:
323:
320:"Release 4.3.2"
318:
317:
313:
308:
285:
276:
233:
176:
104:
90:
85:
83:
80:
61:Initial release
24:
17:
12:
11:
5:
621:
619:
611:
610:
605:
600:
595:
585:
584:
579:
578:
571:
564:
556:
553:
552:
535:
521:
520:
509:
508:External links
506:
503:
502:
491:
480:
469:
458:
444:
433:
422:
411:
400:
368:
355:
344:
333:
310:
309:
307:
304:
284:
281:
275:
274:Uses of Gensim
272:
232:
229:
201:topic modeling
188:
187:
174:
170:
169:
164:
158:
157:
152:
146:
145:
132:
126:
125:
120:
116:
115:
102:
96:
95:
92:
91:
86:24 August 2023
78:
76:
74:Stable release
70:
69:
66:
65:
62:
58:
57:
54:
48:
47:
44:
38:
37:
21:Genshin Impact
15:
13:
10:
9:
6:
4:
3:
2:
620:
609:
606:
604:
601:
599:
596:
594:
591:
590:
588:
577:
572:
570:
565:
563:
558:
557:
551:
549:
545:
542:article is a
541:
536:
533:
529:
524:
517:
512:
511:
507:
500:
495:
492:
489:
484:
481:
478:
473:
470:
467:
462:
459:
454:
448:
445:
442:
437:
434:
431:
426:
423:
420:
415:
412:
409:
404:
401:
397:
395:
379:
372:
369:
365:
359:
356:
353:
348:
345:
342:
337:
334:
321:
315:
312:
305:
303:
300:
298:
294:
293:Google Groups
290:
282:
280:
273:
271:
269:
264:
262:
258:
254:
250:
246:
242:
238:
231:Main Features
230:
228:
225:
221:
216:
214:
210:
206:
202:
198:
194:
185:
175:
171:
168:
165:
163:
159:
156:
153:
151:
147:
144:
140:
136:
133:
131:
127:
124:
121:
117:
113:
103:
101:
97:
93:
77:
75:
71:
67:
63:
59:
55:
53:
49:
46:Radim Řehůřek
45:
43:
39:
35:
30:
22:
548:expanding it
537:
522:
494:
483:
472:
461:
447:
436:
425:
414:
403:
393:
391:
384:. Retrieved
371:
358:
347:
336:
326:18 September
324:. Retrieved
314:
301:
286:
277:
267:
265:
234:
217:
192:
191:
178:radimrehurek
52:Developer(s)
197:open-source
587:Categories
386:27 January
306:References
119:Written in
100:Repository
241:word2vec
237:fastText
255:(LDA),
251:(NMF),
182:/gensim
173:Website
162:License
139:Windows
112:/gensim
84: (
394:gensim
297:Gitter
289:GitHub
257:tf-idf
224:Cython
220:Python
195:is an
193:Gensim
123:Python
106:github
27:Gensim
538:This
381:(PDF)
143:macOS
135:Linux
544:stub
388:2015
328:2023
295:and
259:and
222:and
180:.com
167:LGPL
150:Type
108:.com
64:2009
589::
390:.
299:.
263:.
239:,
215:.
203:,
141:,
137:,
575:e
568:t
561:v
550:.
330:.
184:/
88:)
23:.
Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.