These are public posts tagged with #preprocessing. You can interact with them if you have an account anywhere in the fediverse.
Pipeline release! nf-core/sarek v3.5.1 - 3.5.1 - Akkatjåkkå!
Please see the changelog: https://github.com/nf-core/sarek/releases/tag/3.5.1
#annotation #cancer #gatk4 #genomics #germline #preprocessing #somatic #targetpanels #variantcalling #wholeexomesequencing #wholegenomesequencing #nfcore #openscience #nextflow #bioinformatics
What's Changed Changed Bump schema version to 2.2.1…
GitHubPipeline release! nf-core/sarek v3.5.0 - 3.5.0 - Áhkájiegna!
Please see the changelog: https://github.com/nf-core/sarek/releases/tag/3.5.0
#annotation #cancer #gatk4 #genomics #germline #preprocessing #somatic #targetpanels #variantcalling #wholeexomesequencing #wholegenomesequencing #nfcore #openscience #nextflow #bioinformatics
What's Changed Added Tool: Lofreq callparallel by…
GitHubThis morning I finished another post on my tiny blog, this time about how I set up automatic image pre-processing in @eleventy to maintain a perfect Lighthouse score while allowing myself to be lazy about images: https://www.martingunnarsson.com/posts/eleventy-automatic-image-pre-processing/
#eleventy #11ty #web #webdev #webdevelopment #image #images #processing #preprocessing #performance #webperformance #lighthouse
A brief description of how I set up automatic image…
www.martingunnarsson.comA new benchmark for data
Rather than test if a model is good
This tests whether you can filter data
360 languages
They also share metrics for data redundancy if you want just those
https://arxiv.org/abs/2311.06440
https://github.com/toizzy/
#data #preprocessing #dedup #enough2skim #NLP #NLProc
So this is the #inofficial #opening of #ICFCA2023 with a talk by Johannes Hirth on #preprocessing and #scaling contextual data.
Extremely noticeable #KNOWLEDGE GAPS of ChatGPT in the #history of #Holocaust-related art claims make it clearer than ever the urgency of understanding the data #pipelines that feed the #AI language model.
What #filters are used in #OpenAI's data #preprocessing to EXCLUDE information? Who decides which information to exclude? What triggers exclusion?
#ChatGPT fills gaps with plausible -sounding disinformation - which is a disaster
Text #Preprocessing in #Python: Steps, Tools, and Examples | #nlp https://medium.com/@datamonsters/text-preprocessing-in-python-steps-tools-and-examples-bf025f872908
by Olga Davydova, Data Monsters
medium.com