Content mining, machine learning, text and data mining (TDM) and data analytics all refer to the process of obtaining information through machine-read material. Faster than a human possibly could, machine-learning approaches can analyze data, metadata and text content; find structural similarities between research problems in unrelated fields; and synthesize content from thousands of articles to suggest directions for further research explorations. In consideration of the continually expanding volume of peer-reviewed literature, the value of TDM should not be underappreciated. Text and data mining is a useful tool for developing new scientific insights and new ways to understand the story told by the published literature. Continue reading “Unrestricted Text and Data Mining with allofPLOS”
Data management has become an increasingly discussed topic among the academic community. Managing data is an element of open science, which has proven to increase dissemination of research and citations for journal articles. Open science increases public access to academic articles, mostly through preprint repositories. Indeed, according to this study, open access (OA) articles are…
Watch #scidata17 LIVE now – it’s been great so far and we’ve got loads more fantastic #openscience talks to come! https://www.facebook.com/scientificdata/videos/1410989405696093/ … pic.twitter.com/GSnCFO5Q55
Hindawi’s CEO, Paul Peters, explains the problems inherent in proprietary solutions for Open Science infrastructure and presents a proposal for how things can be done differently. Continue reading “A radically open approach to developing infrastructure for Open Science”
Joe Walsh Author Marketing, Wiley Time flies when you’re having (open) fun. Open Access Week 2017, the 10th International Open Access Week, is here!
As a proud co-founder of Open Access Week, we hope you will join us in celebrating progress and promoting awareness to help make Open Access – the founding principle of PLOS – the new norm in scholarship and research globally. This year’s theme is “Open in Order to…” and invites the community to focus on what openness enables.
Since PLOS’ beginning, we’ve been open in order to accelerate progress in science and medicine through publishing, advocacy and innovation to benefit the research community and beyond. PLOS is open in order to ensure that: research outcomes are discoverable, accessible and available for discussion; science communication is constructive, transparent and verifiable; and publishing advances reproducibility, transparency and accountability. Continue reading “Open Access Week 2017 – Open in Order to…”
A search for truth drives scientific research. One of the tests for research findings is that it must be reproducible research. If you are the only person who can observe a result then that result is invalid. It is important that members of the scientific community invest in replicating original research. As more and more … Read more
Scientific research generates an immense amount of data. It is critical to not only make this data available to research community but also use it to decide on the next experimental question. Not all members of the academic publishing industry are in support of this kind of open access, which means that data remains trapped … Read more
Providing an analysis of Open Data and Open Science policies across Europe, has released a new report. Produced in collaboration with the Digital Curation Center (DCC), it follows on the heels of a previous work that listed national research data policies.
Early career researchers (ECRs) are the largest community of researchers but despite this we know little about their scholarly attitudes and behaviours. Reporting the first-year findings of a longitudinal study of an international panel of ECRs, Dave Nicholas reveals that many remain conservative in their scholarly attitudes and practices. ECRs are concerned by “risky” open peer review, regard archiving their work in repositories as a non-priority, and display little interest in open science or altmetrics. Many ECRs see opportunities for change, but do not feel able to grasp them as they are shackled to a reputational system that promotes publication record and citation scores above all else.
This blog post is based on the author’s co-written article, “Early career researchers: Scholarly behaviour and the prospect of change”, published in Learned Publishing (DOI: 10.1002/leap.1098).
Mozilla fellowships for researchers who want to influence the future of open science and data sharing: http://bit.ly/2oL00aS
Modern day research practice is incredibly collaborative, increasingly interdisciplinary and a very social process. Sierra Williams underlines the importance of researchers and publishers alike recognising publication as one aspect of a much wider social process. By way of introduction to her role at peer-reviewed open access publisher PeerJ, she reflects on the purpose of community in science communication.
The promise of open science to improve the speed, transparency and completeness of research sharing has attracted a lot of innovators and developers creating new, open source technology solutions. All too often, though, technologies are built by organizations that see themselves as competitive with one another and work at cross purposes.
We’re focusing on changing this culture. That may seem a strange statement from a Foundation whose initial work has already launched open infrastructure projects such as PubSweet and INK, but bear with us. Coko is working to seed a new ecosystem of open source projects, tools and platforms that work together.
We envision building an evolving network of modular, interoperable, flexible and reusable open source projects that facilitate rapid, transparent and reproducible research and research communication for the public good. Rather than remaining independent and siloed, these projects will share resources and learn from each other, creating an open science infrastructure. Coko is striving to create a healthy ecosystem of projects that can thrive and work with each other to solve the many problems and opportunities that face STEM publishing today.
Our first small step in this direction — which we see as a giant leap — is pulling together complementary projects to create an Open Source Alliance for Open Science. This federation will actively work together to form the ecosystem, agreeing on best practices that emphasize generosity and openness. The idea is to create a common pool of resources whose development is driven by community needs. Code is shared, so are tips for funding applications, report writing and outreach (etc).
An apt analogy is a community garden: plants that grow well together in common soil are seeded, grown, harvested, shared and plowed back into the land. Individual “plots” may be tended by the gardeners who are most adept at cultivating the seedlings, yet cross-pollination and resource sharing where appropriate are encouraged. The gardeners work in a common space, find territorial solutions and share fruits of the “harvest.”
One example of how we are prototyping this process is with the Substance Consortium, we helped found along with the Public Knowledge Project (PKP), SciELO and Érudit in 2016. Consortium members all use (or intend to use) the open source Texture editor, which helps publishers improve structured documents without having to mess with the underlying markup of XML (extensive markup language). The Consortium started as a way to recognize that organizations using the tools as critical infrastructure have a responsibility to contribute to their upkeep. To that end, Coko has played a foundational role in establishing the consortium, as well as putting energy and funds that contribute to the sustainability of Substance and the codebase.
As another example we introduced the innovative new project, Stencila, to funders — and then stepped aside. Typically, in a competitive environment, smaller projects that are desperate for initial funds may be co-opted by larger ones who overshadow the smaller organization and take a large cut of the funding. The larger project may vacuum up the credit without adhering to attribution best practices. Instead, in a demonstration of good faith, we coached Stenci.la through the funding process and made the direct introductions to funders. Stepping aside to enable Stenci.la to operate as they need to, with the funds they need, and receive the recognition they duly deserve.
Our efforts to cultivate these projects differ from the typical competitive model where organization see what others are doing, then throw shade on the newcomers by claiming to be building the exact same thing. This land grab results in whoever has the superior budget, PR and grant-writing staff, and stronger name recognition “winning,” whether or not they intend to actually create the product, build it well, or share it in a meaningful way. This highly competitive landscape discourages healthy open source communities forming around projects and meaningful, productive, inter-project collaboration.
The garden model will give smaller projects a chance to thrive and grow so as to avoid being co-opted or plowed under. This will create a more diverse and rich ecosystem, since many of these projects arise out of specific expertise that larger projects may not have.
To lay the groundwork for this Alliance, we’re planning a meeting May 1 in Portland, along with founding partners DAT, the Code for Science & Society (CSS) and The California Digital Library (CDL). By meeting in person, discussing initiatives and directly collaborating, we seek to generate buy-in on shared goals and open direct lines of communication between organizations. The initial meeting will garner support for shared goals and values and establish a self-sustaining community with firm attendee commitments to continue the conversation. If you’d like to participate email us at firstname.lastname@example.org