Synopses & Reviews
While Web 2.0 was about data, Web 3.0 is about knowledge and information. Scripting Intelligence: Web 3.0 Information Gathering and Processing offers the reader Ruby scripts for intelligent information management in a Web 3.0 environment?including information extraction from text, using Semantic Web technologies, information gathering (relational database metadata, web scraping, Wikipedia, Freebase), combining information from multiple sources, and strategies for publishing processed information. This book will be a valuable tool for anyone needing to gather, process, and publish web or database information across the modern web environment.Text processing recipes, including speech tagging and automatic summarization Gathering, visualizing, and publishing information from the Semantic Web Information gathering from traditional sources such as relational databases and web sites What you?ll learn Gather and process information within the Web 3.0 environment. See the flexibility of scripting with Ruby to gather and process information. Extract text from various document formats. Work with the Resource Description Framework (RDF) data model and SPARQL query language, the foundations of the Semantic Web. Use GraphViz for data visualization. Extract information from relational databases and web sites. Who this book is for
Anyone needing to gather and display information available in electronic formats Programmers needing to tag, summarize, or publish information Ruby programmers and computer enthusiasts interested in seeing what Ruby can do with information management and Semantic Web tools Academic researchers needing to extract and organize information in a more automated way Table of Contents Parsing Common Document Types Cleaning, Segmenting, and Spell-Checking Text Natural Language Processing Using RDF and RDFS Data Formats Delving Into RDF Data Stores Performing SPARQL Queries and Understanding Reasoning Implementing SPARQL Endpoint Web Portals Working with Relational Databases Supporting Indexing and Search Using Web Scraping to Create Semantic Relations Taking Advantage of Linked Data Implementing Strategies for Large-Scale Data Storage Creating Web Mashups Performing Large-Scale Data Processing Building Information Web Portals
This book covers Web 3.0 technologies from a software developer s point of view. While n- techies can use web services and portals that other people create, developers have the ability to be creators and consumers at the same time by integrating their work with other people s efforts. The Meaning of Web 3.0 Currently, there is no firm consensus on what Web 3.0 means, so I feel free to define Web 3.0 for the context of this book and to cover Ruby technologies that I believe will help you develop Web 3.0 applications. I believe that Web 3.0 applications will be small, that they can be constructed from existing web applications, and that they can be used to build new web applications. Most Web 3.0 technologies will be important for both clients and services. Web 3.0 software systems will need to find and understand information, merge information from different sources, and offer flexibility in publishing information for both human re- ers and other software systems. Web 3.0 applications will also take advantage of new cloud computing architectures and rich-client platforms. Web 3.0 also means you can create more powerful applications for less money by using open source software, relying on public Linked Data sources, and taking advantage of thi- party cloud hosting services like Amazon EC2 and Google App Engine."
Intelligent information management is the key to many Web 2.0 applications, and Semantic Web technologies like the SPARQL query language are helping Web 2.0 evolve into Web 3.0. This book addresses how to use Ruby not only for basic information management tasks, but also how to use it with Semantic Web tools like RDF Schema and SPARQL. This book offers practical advice and example code rather than getting bogged down in the academic and theoretical side of information management.
This book offers the reader recipes using Ruby for intelligent information management including information extraction from text, using Semantic Web technologies, information gathering (relational database metadata, web scraping, Wikipedia, Freebase), combining information from multiple sources, and strategies for publishing processed information. This book will be a valuable tool for anyone needing to gather, process, and publish web or database information.