tdebatty / java-string-similarity Star 2.7k Code Issues Pull requests Implementation of various string similarity and distance algorithms: Levenshtein, Jaro-winkler, n-Gram, Q-Gram, Jaccard index, Longest Common Subsequence edit distance, cosine similarity ... java algorithm distance jaro-winkler levenshtein-distance similarity-measures cosine-similarity string-distance damerau-levenshtein shingles distance-measure Updated Jun 1, 2022 Java
ashvardanian / SimSIMD Star 765 Code Issues Pull requests Up to 200x Faster Inner Products and Vector Similarity ? for Python, JavaScript, Rust, C, and Swift, supporting f64, f32, f16 real & complex, i8, and binary vectors using SIMD for both x86 AVX2 & AVX-512 and Arm NEON & SVE ?? information-retrieval metrics neon numpy assembly simd scipy blas avx2 similarity-measures distance-measures avx512 simd-instructions distance-calculation blas-libraries arm-neon similarity-search float16 vector-search arm-sve Updated Jun 12, 2024 C
hbollon / go-edlib Sponsor Star 458 Code Issues Pull requests Discussions ?? String comparison and edit distance algorithms library, featuring : Levenshtein, LCS, Hamming, Damerau levenshtein (OSA and Adjacent transpositions algorithms), Jaro-Winkler, Cosine, etc... go golang unicode algorithms edit-distance levenshtein jaro-winkler levenshtein-distance similarity-measures string-distance cosine string-matching damerau-levenshtein lcs lcs-distance hamming string-comparison golang-string-comparison edit-distance-algorithms Updated Jul 3, 2022 Go
feature23 / StringSimilarity.NET Star 434 Code Issues Pull requests A .NET port of java-string-similarity algorithms string dotnet distance strings jaro-winkler levenshtein-distance string-metrics similarity-measures cosine-similarity string-distance damerau-levenshtein shingles lcs-distance winkler Updated Jun 3, 2024 C#
patrickzib / SFA Star 308 Code Issues Pull requests Scalable Time Series Data Analytics time-series indexing classification similarity-measures Updated Mar 16, 2022 Java
WenRichard / Customer-Chatbot Star 307 Code Issues Pull requests 中文智能客服机器人demo,包含?聊和???答2?部分,支持自定??件(Chinese intelligent customer chatbot Demo, including the gossip and the professional Q&A(FAQ) , support for custom components!) nlp qa chatbot similarity faq similarity-measures customer-chatbot Updated Apr 9, 2022 Python
cjekel / similarity_measures Star 240 Code Issues Pull requests Quantify the difference between two arbitrary curves in space python dtw measure distance curve similarity-measures warping dynamic-time-warping frechet-distance fr-chet-distance Updated Nov 18, 2023 Jupyter Notebook
firmai / datagene Star 193 Code Issues Pull requests DataGene - Identify How Similar TS Datasets Are to One Another (by @firmai ) encoding finance data-structures decomposition model-checking similarity-measures dataset-generation distance-measures synthesizers similarity-score testing-framework synthetic-data predictive-maintenance synthetic-dataset-generation distance-calculations dataset-similarity transformation-recipes data-transformations Updated Feb 8, 2022 Jupyter Notebook
matchms / matchms Star 169 Code Issues Pull requests Python library for processing (tandem) mass spectrometry data and for computing spectral similarities. analysis fuzzy-search fuzzy-matching python3 similarity-measures metabolomics mass-spectrometry Updated Jun 7, 2024 Python
ansegura7 / Algorithms Star 130 Code Issues Pull requests Discussions Free hands-on course with the implementation (in Python) and description of several computational, mathematical and statistical algorithms. python computer-science statistics algorithms graphs cellular-automata mathematics fractal networkx similarity-measures dynamic-programming dijkstra-algorithm graph-coloring lasvegas-algorithm hanoi-towers divide-and-conquer chaotic-systems probabilistic-algorithms Updated Oct 18, 2023 HTML
drostlab / philentropy Star 128 Code Issues Pull requests Information Theory and Distance Quantification with R r statistics information-theory similarity-measures distance-measures jensen-shannon-divergence distance-quantification parametric-distributions Updated Feb 17, 2024 R
xgfs / verse Star 128 Code Issues Pull requests Reference implementation of the paper VERSE: Versatile Graph Embeddings from Similarity Measures machine-learning graph graph-algorithms machine-learning-algorithms embeddings similarity-measures Updated Feb 21, 2021 C++
firmai / mtss-gan Star 91 Code Issues Pull requests MTSS-GAN: Multivariate Time Series Simulation with Generative Adversarial Networks (by @firmai ) finance time-series simulation generative-adversarial-network stress-test similarity-measures multivariate-data model-validation synthetic-data multivariate-timeseries synthetic-dataset-generation adverserial Updated Sep 29, 2020
Nakilon / dhash-vips Star 88 Code Issues Pull requests vips-powered ruby gem to measure images similarity, implementing dHash and IDHash algorithms gem fingerprint fingerprints similarity-measures image-comparison perceptual-hashing similarity-search dhash Updated Apr 16, 2024 Ruby
chandan-u / graph-based-recommendation-system Star 63 Code Issues Pull requests building a recommendation system using graph search methodologies. We will be comparing these different approaches and closely observe the limitations of each. python data-science algorithms graph-algorithms pandas collaborative-filtering recommendation-system similarity-measures content-filtering Updated Feb 15, 2017 Python
jim-spyropoulos / Trajectory-Analysis-and-Classification-in-Python-Pandas-and-Scikit-Learn Star 62 Code Issues Pull requests Formed trajectories of sets of points.Experimented on finding similarities between trajectories based on DTW (Dynamic Time Warping) and LCSS (Longest Common SubSequence) algorithms.Modeled trajectories as strings based on a Grid representation.Benchmarked KNN, Random Forest, Logistic Regression classification algorithms to classify efficiently t… python machine-learning random-forest dtw scikit-learn classification logistic-regression similarity-measures trajectory-analysis knn trajectory scikitlearn-machine-learning classifiers Updated Feb 1, 2024 Python
frjnn / bhtsne Star 62 Code Issues Pull requests Parallel Barnes-Hut t-SNE implementation written in Rust. rust data-science machine-learning data-visualization dimensionality-reduction similarity-measures barnes-hut bhtsne Updated Jul 29, 2022 Rust
renjunxiang / chatbot_by_similarity Star 52 Code Issues Pull requests 根据文本相似度???答的聊天机器人(??版) nlp chatbot similarity-measures Updated Jul 19, 2018 Python
dumitrescustefan / RoWordNet Star 46 Code Issues Pull requests Romanian WordNet (Data + API for Python) python wordnet romanian similarity-measures rowordnet Updated Aug 6, 2020 Python
babylonhealth / fuzzymax Star 42 Code Issues Pull requests Code for the paper: Don't Settle for Average, Go for the Max: Fuzzy Sets and Max-Pooled Word Vectors, ICLR 2019. machine-learning natural-language-processing word-embeddings similarity-measures research-paper word-vectors Updated Jul 14, 2022 Python