| |
|
|
|
|
NEWSLETTER | February 2025 STRUCTURING DATA AND CONTENT SINCE 1981
|
|
|
|
|
|
|
|
|
|
Genius Without the Gibberish: RAG and Structured Content
|
Retrieval-augmented generation (RAG) is a technique that allows large language models (LLMs) to incorporate external information from a corpus of documents during the text generation process. RAG mitigates the hallucination problem by grounding LLM responses in verified data sources. The key components of text analytics—text preprocessing, natural language processing, entity extraction, and structuring data—play a critical role in this process. These techniques ensure that data fed into an LLM is clean, relevant, and structured to maximize the accuracy and reliability of the responses. By leveraging organizational knowledge and domain-specific datasets, RAG significantly enhances the performance and trustworthiness of LLM outputs.
|
|
|
|
|
|
NISO STS: The Standards' Standard
|
NISO STS is an XML-based standard developed by the National Information Standards Organization to structure and tag standards-related content. The American Water Works Association partnered with DCL to convert over 22,000 pages of standards into NISO STS XML in just 8 months, enhancing digital accessibility and paving the way for innovative product development. This transformation streamlined their publishing workflow, ensuring long-term content interoperability and discoverability. By leveraging automation and expert oversight, AWWA achieved remarkable efficiency without compromising accuracy. Read the full case study to see how structured content can revolutionize industry standards.
|
|
|
|
|
|
The Rosenblum Award for Scholarly Publishing Impact
|
|
Last week NISO, the Association of Learned and Professional Society Publishers (ALPSP), the Association of University Presses (AUPresses), the Society for Scholarly Publishing (SSP), and the International Association of Scientific, Technical & Medical Publishers (STM) announced the launch of The Rosenblum Award for Scholarly Publishing Impact. The award is named in memory of publishing technology expert Bruce Rosenblum and celebrates innovations that have transformed the scholarly publishing ecosystem, focusing technologies, standards, or practices that have become indispensable to its operation. Read more to see the recipient of this inaugural award.
|
|
|
|
|
|
Living Languages Data: How Many Languages are Spoken in Each Country
|
Our World in Data is a project of Global Change Data Lab, a nonprofit based in the UK (Reg. Charity No. 1186433). Its mission is to publish the "research and data to make progress against the world's largest problems." Our World in Data publishes tools and software under the MIT license. A recent data source from the Summer Institute of Linguistics (SIL) International gathers data on living languages in each country around the world. A living language is one that is spoken by at least one person as their first language. Explore the data collected and interact with its representation via graphs, tables, and charts. Can you guess the country with the most living languages?
|
|
|
|
|
|
|
|
SES Annual Meeting - March 19 to 21, 2025 | New Orleans, LA
The Society for Standardization Professionals’ Annual Conference is a premier event that brings together experts and enthusiasts from various industries to discuss the latest trends and advancements in standardization. [LEARN MORE] ConVEx - April 7 to 9, 2025 | San Jose, CA
DCL is exhibiting
ConVEx
is an immersive experience for content developers. Now in its 27th year,
this event offers ideas and information to support organizational
content strategy. [LEARN MORE]
Aviation Week MRO Americas - April 8 to 10, 2025 | Atlanta, GA DCL is exhibiting
MRO Americas is the world's largest gathering of the aviation maintenance community. This premier event brings together industry leaders, innovators, and experts to explore the latest trends, technologies, and strategies in commercial aviation maintenance, repair, and overhaul (MRO). [LEARN MORE]
|
|
|
|
DCL partners with many global organizations that complement our services and offer a complete workflow solution to our customers. Following are some recent highlights from DCL's Partnership Laboratory.
|
|
|
|
Why consider an OEM XML editor instead of building one yourself?
In today’s fast-paced business environment, product managers and decision-makers face tough choices when designing end-to-end solutions for structured content management. One such critical decision revolves around whether to build an XML editor from scratch or integrate a ready-made OEM solution. At Fonto, we believe the advantages of embedding an OEM XML editor far outweigh the challenges of building one in-house. Here’s why.
|
|
|
|
|
|
Journal Performance Analysis – Redefining journal success in a competitive era
The academic community is increasingly focusing on alternative metrics and deeper bibliometric insights, as funders and institutions shift their priorities to assess the broader impact of research. Staying competitive in this environment requires more than just maintaining impact factors; it necessitates tracking emerging trends in science, identifying underrepresented areas, and strategically aligning with them. That’s why Maverick developed Journal Performance Analysis (JPA), a straightforward approach to addressing key challenges in publishing today.
[READ MORE]
|
|
|
|
|
|
Avoid These 5 Localization Mistakes
Localization is an essential process for businesses seeking to expand globally or into new regional markets. However, many companies embarking on this journey neglect best practices, unintentionally hindering success. Whether you need guidance on website localization, apps, or something else, it’s crucial to understand and avoid issues that can derail your progress. By being aware of these localization mistakes, explained by the TransPerfect team, you’ll be better positioned to create a seamless, effective experience for your customers—in any language.
[READ MORE]
|
|
|
|
|
|
The Wild World of the Web in 1996
In the fall of 1996 Netscape Communications Corporation unveiled an integrated client/server software solution for "a new era of open email and groupware as rich as the Web." The new Netscape Intranet solutions enabled corporate customers to build and manage "full service Intranets using the new Netscape SuiteSpot 3.0 integrated server suite." What's truly wild (and confusing from the modern perspective) is how fragmented the tools and portals were in the early days. [READ MORE]
|
|
|
|
|
|
|
|
|
|
|
|
|