-
Notifications
You must be signed in to change notification settings - Fork 20
add parse_audience to all taxila scrapers #1229
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
fbacall
merged 5 commits into
ElixirTeSS:master
from
DaanVanVugt:update/taxila_scrapers_target_audience
Mar 5, 2026
Merged
Changes from all commits
Commits
Show all changes
5 commits
Select commit
Hold shift + click to select a range
8451134
add parse_audience to all taxila scrapers
mikesndrs 4c42590
integrate auto parser into add_event, couple to config var, make avai…
mikesndrs c10c850
remove last instances of parse_audience
mikesndrs 71e8efa
add test for auto parsing
mikesndrs 02f7173
copilot comments
mikesndrs File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,140 @@ | ||
| { | ||
| "Research data": "RDM, FAIR principles and open science", | ||
| "Scientific disciplines": "RDM, FAIR principles and open science", | ||
| "Research methodologies": "RDM, FAIR principles and open science", | ||
| "Research data lifecycle": "RDM, FAIR principles and open science", | ||
| "Roles and responsibilities in RDM": "RDM, FAIR principles and open science", | ||
| "Open science": "RDM, FAIR principles and open science", | ||
| "Reproducibility and replicability": "RDM, FAIR principles and open science", | ||
| "Scholarly communication": "RDM, FAIR principles and open science", | ||
| "Costs for RDM": "RDM, FAIR principles and open science", | ||
| "Data management planning": "RDM, FAIR principles and open science", | ||
| "FAIR data principles": "RDM, FAIR principles and open science", | ||
| "Data discovery": "RDM, FAIR principles and open science", | ||
| "Data reuse": "RDM, FAIR principles and open science", | ||
| "Data collection": "RDM, FAIR principles and open science", | ||
| "Data documentation": "RDM, FAIR principles and open science", | ||
| "Data organisation": "RDM, FAIR principles and open science", | ||
| "File naming (conventions)": "RDM, FAIR principles and open science", | ||
| "Data versioning": "RDM, FAIR principles and open science", | ||
| "Data formats and types": "RDM, FAIR principles and open science", | ||
| "Data back-up": "RDM, FAIR principles and open science", | ||
| "Data selection": "RDM, FAIR principles and open science", | ||
| "Data destruction": "RDM, FAIR principles and open science", | ||
| "Data preservation and archiving": "RDM, FAIR principles and open science", | ||
| "Data publication": "RDM, FAIR principles and open science", | ||
| "Data curation": "RDM, FAIR principles and open science", | ||
| "Data visualisation": "RDM, FAIR principles and open science", | ||
| "Data provenance": "RDM, FAIR principles and open science", | ||
| "Metadata (standard)": "RDM, FAIR principles and open science", | ||
| "Controlled vocabulary, ontology, taxonomy, thesaurus": "RDM, FAIR principles and open science", | ||
| "Linked Open Data and SPARQL": "RDM, FAIR principles and open science", | ||
| "FAIR metrics": "RDM, FAIR principles and open science", | ||
| "3-point FAIRification Framework (FAIR data point, FAIR Implementation Profile)": "RDM, FAIR principles and open science", | ||
| "Persistent identifier": "RDM, FAIR principles and open science", | ||
| "Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH)": "RDM, FAIR principles and open science", | ||
| "Spreadsheet tools": "RDM, FAIR principles and open science", | ||
| "Data modelling": "RDM, FAIR principles and open science", | ||
| "Data integration": "RDM, FAIR principles and open science", | ||
| "Data integrity, validation & quality": "RDM, FAIR principles and open science", | ||
| "Data cleaning & wrangling": "RDM, FAIR principles and open science", | ||
| "Database management": "RDM, FAIR principles and open science", | ||
| "Master data management": "RDM, FAIR principles and open science", | ||
| "Business intelligence": "RDM, FAIR principles and open science", | ||
| "RDM service model": "RDM, FAIR principles and open science", | ||
|
|
||
| "Research software": "Research software management", | ||
| "Software management planning": "Research software management", | ||
| "Software version control (e.g., git and GitHub)": "Research software management", | ||
| "Software documentation": "Research software management", | ||
| "Software packaging (R, Python, etc.)": "Research software management", | ||
| "Software citation": "Research software management", | ||
| "FAIR software": "Research software management", | ||
| "Reproducibility": "Research software management", | ||
| "Coding conventions": "Research software management", | ||
| "Literate programming": "Research software management", | ||
| "Scientific workflows and data pipelines": "Research software management", | ||
| "Computer programming": "Research software management", | ||
| "Virtual environments and Containerisation": "Research software management", | ||
| "Continuous integration": "Research software management", | ||
| "Use of generative Artificial Intelligence in writing research software": "Research software management", | ||
|
|
||
| "(Certified) Data Repositories": "Data infrastructure", | ||
| "Repository quality standards (e.g. CoreTrustSeal, ISAD(G), OAIS reference model)": "Data infrastructure", | ||
| "TRUST principles for digital repositories": "Data infrastructure", | ||
| "Tool criticism": "Data infrastructure", | ||
| "Data collection tools": "Data infrastructure", | ||
| "Data management services/tools": "Data infrastructure", | ||
| "Data storage (media)": "Data infrastructure", | ||
| "Data transfer tools": "Data infrastructure", | ||
| "Data analysis software/tools": "Data infrastructure", | ||
| "Cloud computing and High-performance computing": "Data infrastructure", | ||
| "Data security and Data classification": "Data infrastructure", | ||
| "Available RDM infrastructure and organisations": "Data infrastructure", | ||
| "European Open Science Cloud solutions": "Data infrastructure", | ||
|
|
||
| "European, national and institutional policies on RDM, RSM and open science": "Policy and governance", | ||
| "Funder RDM, RSM and open science requirements": "Policy and governance", | ||
| "Journal policies related to RDM": "Policy and governance", | ||
| "Policy development": "Policy and governance", | ||
| "Policy implementation": "Policy and governance", | ||
| "Policy monitoring": "Policy and governance", | ||
| "Translating policy to organisational strategy": "Policy and governance", | ||
| "Responsible metrics (bibliometrics, altmetrics)": "Policy and governance", | ||
| "Digital sovereignty": "Policy and governance", | ||
| "Data governance": "Policy and governance", | ||
| "Data ownership": "Policy and governance", | ||
| "Key Performance Indicators (KPI) for RDM": "Policy and governance", | ||
|
|
||
| "Privacy and data protection (GDPR, UAVG)": "Legal and ethical responsibilities", | ||
| "Sensitive data/Confidentiality": "Legal and ethical responsibilities", | ||
| "Intellectual property rights": "Legal and ethical responsibilities", | ||
| "copyright": "Legal and ethical responsibilities", | ||
| "patents": "Legal and ethical responsibilities", | ||
| "trademarks": "Legal and ethical responsibilities", | ||
| "Research in consortia": "Legal and ethical responsibilities", | ||
| "Data and software licenses ": "Legal and ethical responsibilities", | ||
| "License compatibility": "Legal and ethical responsibilities", | ||
| "Information security": "Legal and ethical responsibilities", | ||
| "Knowledge security": "Legal and ethical responsibilities", | ||
| "European data legislation": "Legal and ethical responsibilities", | ||
| "AI Act": "Legal and ethical responsibilities", | ||
| "Data Governance Act": "Legal and ethical responsibilities", | ||
| "Data Act": "Legal and ethical responsibilities", | ||
| "European Health Data Space": "Legal and ethical responsibilities", | ||
| "(Cyber)security legislation, e.g. NIS2 Directive": "Legal and ethical responsibilities", | ||
| "Trade Secret Protection Act": "Legal and ethical responsibilities", | ||
| "Research ethics and integrity": "Legal and ethical responsibilities", | ||
| "CARE principles": "Legal and ethical responsibilities", | ||
| "Diversity, equity & inclusion": "Legal and ethical responsibilities", | ||
|
|
||
| "Needs assessment": "Training and awareness raising", | ||
| "Instructional design": "Training and awareness raising", | ||
| "FAIR-by-Design methodology": "Training and awareness raising", | ||
| "Carpentries methodology": "Training and awareness raising", | ||
| "Training andragogy ": "Training and awareness raising", | ||
| "Didactic methods": "Training and awareness raising", | ||
| "Presentation skills": "Training and awareness raising", | ||
| "Written communication skills": "Training and awareness raising", | ||
| "Open Educational Resources": "Training and awareness raising", | ||
| "Diagnostic, formative and summative assessment": "Training and awareness raising", | ||
| "Course evaluation ": "Training and awareness raising", | ||
| "Student satisfaction": "Training and awareness raising", | ||
|
|
||
| "Networking skills": "Transversal skills", | ||
| "Community management": "Transversal skills", | ||
| "Existing RDM networks/ communities": "Transversal skills", | ||
| "Consultancy": "Transversal skills", | ||
| "Advocacy": "Transversal skills", | ||
| "Conflict resolution": "Transversal skills", | ||
| "Negotiating": "Transversal skills", | ||
| "Active listening": "Transversal skills", | ||
| "Stakeholder analysis": "Transversal skills", | ||
| "Stakeholder engagement": "Transversal skills", | ||
| "Organisational development": "Transversal skills", | ||
| "Project management (methodologies)": "Transversal skills", | ||
| "Change management": "Transversal skills", | ||
| "Binding Leadership": "Transversal skills", | ||
| "Facilitation": "Transversal skills", | ||
| "Teamwork": "Transversal skills" | ||
| } |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,27 @@ | ||
| { | ||
| "post-docs": "researchers", | ||
| "PhD's candidate": "researchers", | ||
| "PhD student": "researchers", | ||
| "principal investigator": "researchers", | ||
| "professor": "researchers", | ||
| "scientist": "researchers", | ||
|
|
||
| "library staff": "research support staff", | ||
| "research librarian": "research support staff", | ||
| "information specialist": "research support staff", | ||
| "archivist": "research support staff", | ||
| "repository manager": "research support staff", | ||
| "data steward": "research support staff", | ||
| "data manager": "research support staff", | ||
| "data professional": "research support staff", | ||
| "data engineer": "research support staff", | ||
| "software engineer": "research support staff", | ||
| "data librarian": "research support staff", | ||
|
|
||
| "bachelor": "students", | ||
| "master": "students", | ||
|
|
||
| "teacher": "trainers", | ||
| "coaches": "trainers", | ||
| "educator": "trainers" | ||
| } |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,23 @@ | ||
| module Ingestors | ||
| module AutoParsing | ||
| def auto_parse(var, description) | ||
| @auto_parse_cache ||= {} | ||
| json_path = File.join(Rails.root, 'lib', 'ingestors', 'auto_parser_mappings', "#{var.to_s}.json") | ||
| return nil unless File.exist?(json_path) | ||
|
|
||
| mtime = File.mtime(json_path) | ||
| cached = @auto_parse_cache[var] | ||
| if cached && cached[:mtime] == mtime | ||
| mapping = cached[:mapping] | ||
| else | ||
| mapping = JSON.parse(File.read(json_path)) | ||
| @auto_parse_cache[var] = { mtime: mtime, mapping: mapping } | ||
| end | ||
|
|
||
| mapping | ||
| .select{ |key, val| description&.downcase&.include?(key.to_s.downcase) } | ||
| &.values | ||
| &.uniq | ||
| end | ||
| end | ||
| end |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.