- April 15, 2019
Events are as important as data in emerging applications underlying many e-commerce efforts. Streams of events tell a company what motivates customers to use online products.
- April 04, 2019
Tools such as Unravel and Pepperdata offer a way to measure performance of big data cloud applications, which may aid companies with on-premises configuration issues.
- March 25, 2019
Startups Interana and Rockset differ in their approaches to providing new query capabilities on fast-arriving big data. Both are led by technologists who started at Facebook.
- March 25, 2019
Experts say data professionals should work to create a common vocabulary in organizations to help boost data governance and compliance with laws like GDPR.
- March 21, 2019
Machine learning will bring change to analytics and data management, said data luminary Michael Stonebraker. Others agree managing such change will take special effort.
Sponsored by NetApp - The growing ubiquity of cloud storage has been accelerated substantially by a variety of factors, not the least of which has been the COVID-19 pandemic that has spiked cloud usage to complement, and in some cases even replace, on-premises storage. But with that growth of cloud storage has come a commensurate increase in risk as more data is in transit. See More
Sponsored by NetApp - Cloud computing is now a hallmark of all enterprise IT strategies, and public cloud computing has become one of the most transformative factors in the entire enterprise computing landscape. By 2023, global expenditures on public cloud computing will exceed $1.1 trillion by 2023, representing a 6-year compound annual growth rate of nearly 24%. See More
Sponsored by NetApp - Supporting distributed workforces with essential applications, until recently a concern primarily for larger enterprises, is now a fact of life for nearly all organizations. The global e-commerce movement has dramatically expanded organizations’ ability to do business in any region of the world, typically necessitating local workforces for such functions as support, service and marketing. And the COVID-19 pandemic put an exclamation point on one of the most transformative issues of commerce—remote work. See More
Sponsored by NetApp - Most organizations know that hybrid cloud’s many benefits extend far beyond financial factors. Agility, flexibility and scalability are just some of the most attractive features of hybrid cloud, as well as the ability to move workloads from on-premises to the cloud and back, or from cloud to cloud. In fact, these and other benefits have driven most organizations to adopt hybrid cloud as part of their overarching IT service delivery strategy. See More
- March 15, 2019
Many data professionals have yet to solidify traditional data management practices, but they have a new set of challenges to overcome to ensure data privacy and avoid misuse.
- March 12, 2019
Apache Kafka and Apache Spark connectors ease use of the Aerospike NoSQL data store in high-speed applications such as analytics that are becoming more broadly supported.
- March 11, 2019
Data catalogs form a hub for managing enterprise data. New products focus on machine learning and AI add-ons that help automate aspects of data governance.
- February 26, 2019
An in-memory data grid (IMDG) from Hazelcast lets designers tune subsystems to support consistency over availability, or the reverse, depending on what designers want.
- February 14, 2019
The Presto engine arose as an alternative to Hive for big data queries. Now, the Presto Software Foundation has formed to promote the SQL query software's virtues.
- February 11, 2019
StoryFit data scientists employ machine learning algorithms to gauge film script scenarios' prospects. They use Import.io tools to make data preparation easier.
- February 01, 2019
Cloud giants like AWS have adopted open source databases, causing Confluent, MongoDB and others to guard their assets the best way they know how: licensing.
- January 22, 2019
Cloud architecture, analytics and AI data processing are top innovation priorities for new Teradata CEO Oliver Ratzesberger. He talks about his goals in this Q&A.
- January 21, 2019
The Data.gov shutdown shows that, as open data can be turned off, data professionals may need to consider alternative sources for the kinds of data the government offers.
- January 15, 2019
Two wunderkinds of Hadoop have formalized their merger. Cloudera and Hortonworks say they will place special focus on AI as they chart the stand-alone vendor's future.
- January 10, 2019
IBM CEO Ginni Rometty pushed real-time weather data mining for travel and other uses, even as the Los Angeles city attorney is suing IBM's Weather Company unit over its sharing of location data with business partners.
- December 28, 2018
Better data governance, increased cloud use and wider DataOps adoption head the list of trends for data management teams to plan for in 2019, IT analysts say.
- December 18, 2018
Datawatch plans to add analytics and data visualization tools to its Swarm data preparation platform, starting with integration capabilities in a Swarm 2.2 release.
- December 14, 2018
Third-party vendors that offer data platforms to AWS users tout hedges against cloud lock-in. But they must both compete and collaborate with the cloud leader.
- November 30, 2018
Built-for-purpose databases target general ones, especially in the cloud. At re:Invent 2018, AWS added time-series and transaction ledger databases to expand its line.
- November 27, 2018
A new Microsoft Azure SQL Database Managed Instance service seeks to span the gap between cloud and on-premises systems. Meanwhile, Oracle also has cloud plans of the data kind.
- November 06, 2018
The IBM acquisition of Red Hat marks a watershed in computer architecture. The duo says it can rebuild data applications in new ways. This news analysis explores what's coming.
- October 12, 2018
MarkLogic rolled out a cloud-service version of its NoSQL database management system, a move designed to make the technology more cost-effective for cloud users.
- October 04, 2018
Hadoop users will have fewer choices as big data rivals Cloudera and Hortonworks unite. But the new company may be more competitive with AWS and Google.
- September 26, 2018
BI on Hadoop is still new, but moving BI to data is trending. A data scientist working with IoT data at Komatsu sees the importance of getting big data to the right people.
- September 13, 2018
Hortonworks is joining with Red Hat and IBM to work together on a hybrid big data architecture format that will run using containers both in the cloud and on premises.
- September 07, 2018
Hadoop data tooling is expanding. A view holds that Hadoop is moving from alternate data warehousing to a full-fledged big data analytics offering.
- September 04, 2018
A graph database startup's parallel loading, processing and querying combine to deliver real-time data for fintech firms that offer fast credit evaluations online.
- August 08, 2018
Confluent Platform updates seek to bring data streaming with Apache Kafka to a wider audience. A new GUI and user-defined functions are part of the 5.0 release.
- July 19, 2018
Chief data officers and experts see the CDO role as changing to a more strategic orientation -- especially finding key opportunities in vast troves of data.
- July 16, 2018
The chief data officer role is about many things -- regulations, innovation, AI and more. Consultant Randy Bean discussed the matter ahead of an MIT symposium on the topic.
- July 02, 2018
GDPR influence is touching a Hadoop big data world that was immune to many privacy considerations until now. This podcast features the rise of Hadoop data governance for data lakes.
- June 27, 2018
NoSQL vendor MongoDB upgraded its database software with ACID support, while also releasing a serverless platform intended to simplify application development.
- June 25, 2018
Hortonworks users talk about building Hadoop data lakes to support new applications -- and the challenges their teams face on ingesting and refining data for end users.
- June 18, 2018
Hortonworks now supports Google Cloud Storage and has also broadened cloud deals with Microsoft and IBM, aiming to increase cloud uses of its big data platform.
- March 21, 2018
Big data vendors and users are looking to Kubernetes-managed containers to help accelerate deployments and enable more flexible use of computing resources.
- March 15, 2018
StreamSets software for inspecting big data brings governance to data in motion. Such capabilities may find more use as the European Union's GDPR deadline looms.
- February 22, 2018
In big data news, we find Google TPUs, or Tensor Processing Units, offered as a cloud service, while LinkedIn is open sourcing a Hadoop test simulator called Dynamometer.
- February 16, 2018
MongoDB is taking a deeper step into SQL-style processing waters with a 4.0 update that brings increased support for ACID-compliant transactions to its NoSQL database.
- January 03, 2018
Is this the post-Hadoop era? Not in the eyes of Hadoop 3.0 backers, who see the latest update to the big data framework succeeding in machine learning applications and cloud systems.
- December 01, 2017
After Cyber Monday, Amazon Web Services techies headed for re:Invent 2017. There, among a deluge of cloud product announcements by AWS, the Amazon Neptune graph database surfaced.
- December 01, 2017
Born at Cloudera, the MPP query engine known as Apache Impala has become a top-level open source project. It's one of various tools bringing SQL-style interactivity to big data analytics.
- October 30, 2017
The Neo4j graph database emphasizes easy relationship mapping for diverse data points. Now, its related Cypher query language is hooking into Apache Spark.
- October 05, 2017
At the Strata conference in New York, IT managers detailed steps they're taking to improve data quality in their big data environments in order to help ensure analytics accuracy.
- September 28, 2017
The Strata conference in New York saw big data platform vendor MapR Technologies update MapR-DB, its NoSQL database engine, to better perform in real-time analytics applications.
- September 26, 2017
ETL jobs -- once the sole province of IT -- take on a new form as data wrangling and self-service gain greater traction with business users of analytics.
- September 06, 2017
Breitburn Energy Partners employed data quality tools to address the business pain of bad data, using the software to give end users the means to fix data quality issues themselves.
- August 31, 2017
SQL on Hadoop arrived -- so did SQL on Spark. Now, SQL on Kafka is emerging to provide a different way to look at Kafka data as it streams through the enterprise.
- August 30, 2017
In this Talking Data podcast, TechTarget editors discuss Hadoop's future, IBM's decision to resell the Hortonworks distribution of the open source technology and other big data issues.
- July 31, 2017
Data management startup Dremio has aimed its Apache Arrow expertise at the problem of self-service data delivery. In-column caches and optimization speed queries across varied data stores.
- July 24, 2017
In many organizations, chief data officer jobs centered on defense against risk are giving way to ones emphasizing innovation. To do so, CDOs must nurture a data culture, MIT panelists said.
- July 10, 2017
MongoDB targets better dashboard visualization with MongoDB Charts, which adds another means for business users trying to look into their NoSQL data pools.
- June 30, 2017
The quest for the agile database is putting developers in the forefront and has some DBA tasks moving to the development groups, according to panelists at a conference in Boston.
- June 29, 2017
With the EU's new General Data Protection Regulation looming on the horizon, companies -- including many in the U.S. -- need to get going on required data governance upgrades.
- June 23, 2017
MongoDB has expanded cloud coverage for its Atlas hosted database service, with Azure and Google versions joining an initial AWS-based offering to give users a choice on cloud platforms.
- June 20, 2017
IBM pulled the plug on its distribution of Hadoop in favor of reselling Hortonworks' bundle of big data technologies, a decision that reduces the number of Hadoop vendors to four.
- May 31, 2017
Deep learning applications often require a mix of data, and assorted preprocessing techniques. That makes data preparation a priority, and conventional machine learning may have a role to play.
- May 12, 2017
Kafka is a linchpin in many on-premises big data pipelines. Now, software vendor Confluent is offering a Kafka cloud service to ease use of the messaging and data streaming system in the cloud.
- April 28, 2017
Data lakes offer a more expansive alternative to data warehouses for analytics uses. TDWI analyst Philip Russom offers advice on how to get things right in a data lake architecture.
- April 28, 2017
Systems of engagement represent a hotbed of activity in data management these days. Flexibility and scalability are watchwords.
- April 20, 2017
Corporate users are becoming more open to deploying big data systems with Apache Spark in the cloud, Databricks CEO Ali Ghodsi says in a Q&A on the open source processing platform.
- April 14, 2017
Software containers encapsulate complexity and ease deployment, two traits that are helping to elicit growing interest in using them as part of big data systems.
- March 31, 2017
Fitness company Beachbody set up a data lake system in the AWS cloud to support big data analytics applications after deciding that an on-premises deployment would be too complicated.
- March 24, 2017
Blockchain data technology disruption may be in the offing. IDC's Stewart Bond says architecture at the core of controversial bitcoin may show a new path to data integrity.
- March 10, 2017
Businesses constantly need to evolve their programs for governing data. Nationwide's finance data governance team shares how it stepped up data governance strategy and processes.
- March 08, 2017
Application profiling software from Pepperdata is built on LinkedIn's Dr. Elephant open source entry. A primary goal is to get more Hadoop and Spark applications into production.
- February 21, 2017
Moving custom Spark and Hadoop pilot projects into production use has proved daunting. But container technology eased the transition at the Advisory Board analytics service.
- February 16, 2017
Spark Streaming architecture to date has focused much on programming perks. Now, as a bit of a hedge against other streaming choices, Drizzle comes to bat to cut streaming latency.
- February 06, 2017
Predictive models help Jewelry Television's on-air hosts sell its wares, thanks to data integration and preparation processes that funnel a mix of data into the analytics applications.
- February 06, 2017
Data scientists building predictive models and machine learning algorithms often have to do more data preparation work upfront than is necessary in conventional analytics applications.
- February 06, 2017
Increased automation of data pipelines and more flexibility for data scientists through self-service software are taking hold as big data deployments change data preparation practices.
- February 03, 2017
The head of Kaiser Permanente's data governance program says data stewards hold the key to the initiative's success, and he offers advice on managing data stewardship processes.
- January 31, 2017
Big data analytics and digital transformation challenge the conventional data governance process, with many questions to answer in organizations. But a governed data lake shows how it can be done.
- December 02, 2016
Amazon's Athena data engine brings interactive SQL queries to S3 data sets and lets users pay as they go. It's based on an open source framework called Presto that Teradata and others also employ.
- November 30, 2016
The Louisiana Department of Health responded to flooding with the help of GIS software that located trouble spots with at-risk hospitals. Ease of use was welcome, according to a preparedness manager.
- October 28, 2016
Big data is moving from its bare-metal roots, and data streaming is a driver. Containers and microservices may have a role to play in what's next. An e-commerce application shows the way.
- October 06, 2016
What's in your toolbox? October's issue of Business Information turns the tables and puts that burning question to Capital One and several other business intelligence and data analytics software users. As the burgeoning worlds of ...
- September 30, 2016
Users increasingly are eyeing the cloud for big data management and analytics applications, and IT vendors are moving to ease the process -- and the price -- of running Hadoop in the cloud.
- September 29, 2016
Among a handful of new SQL-oriented, in-memory databases is MemSQL. Recent product updates are meant to improve data pipeline creation and performance in high-speed ingestion applications.
- September 15, 2016
HPE is paring down its software holdings, including analytical database software in the Vertica line and other big data tools. A sale to Micro Focus is due to close next year, leaving users in some limbo for now.
- August 31, 2016
Vertica 8.0 expands the analytical database's support for Kafka, Spark and Hadoop. That's an important step, as the Hewlett Packard Enterprise technology tries to compete in a field of diverse data tools.
- August 30, 2016
Cloud data warehouse offerings from smaller vendors seek to address functionality gaps that bigger players may miss. Newcomer Snowflake Computing targets concurrent queries, for example.
- August 23, 2016
Managed data services are growing in use, as types of data stores proliferate and the cloud becomes the home for more data. DevOps is a driver behind the changes, which bring new duties and needed skills for DBAs.
- July 22, 2016
How to balance data safety with innovative big data expansion was at issue at an MIT symposium where the chief data officer role was considered.
- July 15, 2016
Forces at work in data management have led to the advent of the chief data officer. The role of the CDO and more is discussed in a Q&A with consultant Joe Caserta.
- July 08, 2016
Speed to production underlies interest in MongoDB. At its annual confab, the company behind the NoSQL database rolled out improvements, including a Spark analytics link.
- July 06, 2016
Hadoop management is becoming a bigger priority for big data users and vendors alike as the distributed processing framework plays a more central role in the business operations of organizations.
- June 29, 2016
Thanks to ubiquitous mobile technology, field data is more readily accessible to ESRI's Survey123 for ArcGIS. A relief organization used the software to aid Syrian war refugees.
- June 24, 2016
DataStax Enterprise 5.0 couples a Cassandra column-family data store with a rewritten version of the open source Titan graph database. The goal is fast analytics closely tied to fast transactions.
- June 24, 2016
On the occasion of ComputerWeekly's 50th anniversary, Brian McKenna joins the Talking Data podcast crew to look back at Bletchley Park, and forward to Hadoop and AI.
- June 17, 2016
Graph technology is popping up in many places, including master data management. A major data integration player has joined the quest, as seen recently at Informatica World.
- June 09, 2016
Spark 2.0, with structured streaming and SQL 2003 support, is aborning as indicated at Databricks' Spark Summit, where R-to-Spark interfaces also popped up.
- June 03, 2016
Startup vendor Confluent is looking to place a stake in the big data ecosystem with Kafka streaming and management tools meant to reduce complexity in applications that place data in motion.
- June 01, 2016
Data wrapping -- in this case, bundling data and analytics services with products -- may entice more companies to become data businesses. A panel at an MIT symposium considered some best practices for doing so.
- May 19, 2016
New cloud apps seem ready-made for NoSQL. This may cause Oracle to put more focus on its Oracle NoSQL database, which is often overlooked amid a crush of NoSQL contenders.
- May 16, 2016
In an interview, consultant Lakshmi Randall foresees changes in how data management is organized and executed as the overall data landscape shifts due to the adoption of big data systems.
- April 29, 2016
Surging big data is changing data modeling techniques, including schema creation. The word from Enterprise Data World 2016: Data pros must adjust.
- April 29, 2016
Open source data engineering has become a way of life at e-commerce leader eBay, says the company's Debashis Saha. Kylin is one of the tools that has resulted.
- April 22, 2016
A new view on hybrid data architectures, in which data lakes and warehouses coexist, emerged at EDW 2016. The hybrid approach has implications for data design, skills and planning.
- April 19, 2016
Running a Hadoop cluster in the data center isn't for the weak. But several new tools aim to give IT operations teams a closer look into what's going on inside Hadoop-based big data systems.