Before deciding to use ID as a column name, consider this scenario when dealing with CSV files…
"SYLK: File format is not valid" error message when you open file
"SYLK: File format is not valid" error message when you open file
A dump of all things SQL Server, SQL Reporting Services, Analysis Services, around Business Intelligence, Business Performance Management and technology.
Before deciding to use ID as a column name, consider this scenario when dealing with CSV files…
"SYLK: File format is not valid" error message when you open file
"SYLK: File format is not valid" error message when you open file
Some information on SAP Data Services here, and a refresher on Databases and Data Warehousing.
Also, an interesting post on ETL vs ELT. Would it be faster to use ETL to transform your data before loading, or would it be faster to just copy your data directly to the warehouse and manage it inside?
Perhaps this is the role the new FileTable type in SQL 2012 will play for you. Why not FTP your CSV files directly into SQL as blobs, and manage the transformations within?
Maybe not the best example, but it does centralize things a bit more…
SQL Server 2012 is released this weekend. SQL 2012, you can now use select-style statements to return customized results from a single stored procedure.
EXEC uspGetEmployeeManagers 16 WITH RESULT SETS ( ([Reporting Level] int NOT NULL, [ID of Employee] int NOT NULL, [Employee First Name] nvarchar(50) NOT NULL, [Employee Last Name] nvarchar(50) NOT NULL, [Employee ID of Manager] nvarchar(max) NOT NULL, [Manager First Name] nvarchar(50) NOT NULL, [Manager Last Name] nvarchar(50) NOT NULL ) );
Custom paging using an ORDER BY clause
-- Skip 0 rows and return only the first 10 rows from the sorted result set. SELECT DepartmentID, Name, GroupName FROM HumanResources.Department ORDER BY DepartmentID OFFSET 0 ROWS FETCH NEXT 10 ROWS ONLY;
Throw to negate the need for custom messages with RAISERROR
C#/VB style Format to format dates, times, numbers
OVER clause for windowed summarization / rolling totals
PARSE to convert strings to dates, currencies, etc
TRY_CONVERT to avoid errors when converting data types
TRY_PARSE to determine if strings = data types
DATEFROMPARTS to take a year, month and day and create a date
CHOOSE to return an index item from a list of selections. (Arrays in SQL?)
Some helpful tools coming out of SQL MDS, including regex and XSLT straight from T-SQL.
I've been having a play with SQL 2008 R2 and its new Master Data Services (MDS). I was pleased to see it adds some additional functions when you create a new MDS database. These include mdq.Split and mdq.RegexReplace which implement handy Regular Expression functions. Most of these are documented at: http://msdn.microsoft.com/en-us/library/ee633712(SQL.105).aspx
I also discovered a scalar function called mdq.XmlTransform which allows you to use XSLT:
Tracking your social media campaign? Here are some tools to look at.
A couple years ago, Jay Baer wrote a great blog post called ‘The 39 social media tools I’ll use today’ which was an all-in-one toolkit for social media marketers (and still is).
A lot has changed in the two years since that post was published so here is a ’2012 remix’ featuring 50 (mostly free) tools you can use on a daily basis.
Whether you are just starting out in the social media arena or have been at it for a few years, this will hopefully be a handy resource. So, let’s serve ‘em up
Looking for a Windows 8 / HTML5 map control?
CanvasMap is a JavaScript map visualization control (similar to Bing Maps or Google Maps) that is built entirely on top of using the HTML5 Canvas element to render the map. Currently, neither Bing Maps or Google Maps use the Canvas element to render maps, so I thought I’d do this to see how the performance might compare. I know there are a few alternatives already, but I thought I’d have some fun trying to build my own.
One fork in the road may be a single platform for Windows Phone & Windows 8 apps, or the ability to run WP7.5 apps on Win8.
Which would be nice. Kind of like running iPhone apps on the iPad.
The star of last year's product roadmap was Office 365. Microsoft this year looks likely to pivot from the public cloud to the private cloud, while also shipping staples like SQL Server and releasing major betas -- and possibly final versions -- of blockbusters including Windows 8 tablet, client and server, and Office 15.
The 2012 Microsoft Product Roadmap -- Redmond Channel Partner
Need a fast way of getting lat/long values for a batch of addresses in Excel?
A lot of folks have to geocode data that they get from various folks in their organization, and it often makes its way in to Excel. Of course, you can batch geocode using Bing Maps Services, but in some scenarios it is a lot more convenient or reasonable to use Excel.
Don’t forget to ensure that your test cases contain time-based events, like leap years and Daylight Savings. Not that it would have helped much, since it was a cert expiry that caused the issue. The moral of the story? Ensure critical apps have a disaster recovery plan that doesn’t include a single source of failure (cloud provider).
When the clocks struck midnight, things quickly got janky, and a cloud-system domino effect took charge. A large number of Western Hemisphere sites and the U.K. government's G-Cloud CloudStore were among the many stopped cold by the outage. Microsoft has been retracing its steps in finding out what exactly happened and hasn't said very much yet, although it did report in an Azure team blog that the problem has "mostly" been fixed.
IT in Canada - Canada's Only Integrated Social Media News Network
NoSQL, or DIY SQL is the most common technique for blasting out distributed data stores.
This article is a great overview of the types of NoSQL designs out there, and ways to model the data.
NoSQL databases are often compared by various non-functional criteria, such as scalability, performance, and consistency. This aspect of NoSQL is well-studied both in practice and theory because specific non-functional properties are often the main justification for NoSQL usage and fundamental results on distributed systems like CAP theorem are well applicable to the NoSQL systems. At the same time, NoSQL data modeling is not so well studied and lacks of systematic theory like in relational databases. In this article I provide a short comparison of NoSQL system families from the data modeling point of view and digest several common modeling techniques.
I recently signed up for mailings from Analyticbridge, the social network for analytic professionals. It seems like daily emails are the norm, which I’ll have to setup a rule to move someplace. In the meantime, they just released a list of the top analytic blogs and websites.
Worth checking out.
Top analytic blogs and websites, with trending information - AnalyticBridge