Silvia OnofreiinTowards Data ScienceRelation Extraction with Llama3 ModelsEnhanced relation extraction by fine-tuning Llama3–8B with a synthetic dataset created using Llama3–70BApr 264Apr 264
Silvia OnofreiinTowards Data ScienceCypher Generation: The Good, The Bad and The MessyMethods for creating fine-tuning datasets for text-to-Cypher generation.Jan 29Jan 29
Silvia OnofreiinTowards Data ScienceLeverage KeyBERT, HDBSCAN and Zephyr-7B-Beta to Build a Knowledge GraphLLM-enhanced natural language processing and traditional machine learning techniques are used to extract structure and to build a knowledge…Jan 78Jan 78
Silvia OnofreiinTowards Data ScienceTransforming text into vectors: TSDAE’s unsupervised approach to enhanced embeddingsCombine TSDAE pre-training on a target domain with supervised fine-tuning on a general-purpose corpus to enhance the quality of the…Oct 16, 2023Oct 16, 2023
Silvia OnofreiCode Llama’s “Knowledge” of Neo4j’s Cypher Query LanguageA simple experiment into how does Code Llama with Neo4j’s query language CyperAug 28, 20232Aug 28, 20232
Silvia OnofreiTopic Modeling with Healthcare Spark NLPHow to leverage Healthcare Spark NLP pretrained models to categorize a small collection of publications on equine colicMay 26, 2022May 26, 2022
Silvia OnofreiDid Stacking Improve My PySpark Churn Prediction Model?Stacking with PySpark to predict customer churn for a fictional music platform Sparkify.Feb 17, 2022Feb 17, 2022
Silvia OnofreiUser Activity Based Churn Prediction With PySpark on an AWS-EMR ClusterAnalyze and predict customer churn for a fictional music platform Sparkify.Feb 17, 20221Feb 17, 20221
Silvia OnofreiWeb Scraping Mini ProjectExtract Text From Udacity Course Catalog Website Using Beautiful Soup and SeleniumSep 21, 2021Sep 21, 2021
Silvia OnofreiWho Are the Data Professionals?Three way analysis of the StackOverflow Developers Annual Survey.Aug 18, 20211Aug 18, 20211